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METHODS FOR REGULATING VASCULARIZATION USING 

GEF CONTAINING NEK-LIKE KINASE (GNK) 
C R OSS- R E FE RE N C F TO RELAT ED APPLICATION 
The present application claims the benefit of U.S. Provisional Application No. 
5 60/1 13,003, filed December 18. 1998, which is hereby incorporated by reference. 

FTF.I D OF THE INVENTION 
The invention is generally directed toward the use of a novel protein kinase, 
GEF containing NEK-like Kinase (GNK), previously designated [L-l/TNF-cc 
activated kinase (ITAK), and its physiological substrate, sGNK, in regulating 
1 0 vascularization. More specifically, the invention is directed to stimulating blood 
vessel development using the GNK and its agonists, and to inhibiting inappropriate 
blood vessel development using antagonists of GNK. 

BACKGROUND OF THE INVENTION 
The protein kinases regulate many different cell cycle, differentiation and 
1 5 signaling processes by catalyzing the addition of phosphate groups to protein 

substrates. Reversible protein phosphorylation is the main mechanism for regulating 
eukaryotic cell activities. Proteins are generally activated by the kinase-catalyzed 
transfer of high energy phosphate from adenosine triphosphate (ATP) or guanine 
triphosphate (GTP), referred to as phosphorylation, and deactivated by the reverse 
20 process, referred to as dephosphorylation, the removal of phosphate group from 

activated proteins by enzymes known as protein phosphatases. While some kinases 
act on a single substrate to bring about their biological effect, others are involved in 
complex biological networks or signaling pathways in which kinase-catalyzed 
phosphorylation triggers a cascade effect with multiple "downstream" events, which 
25 may include the activation of additional kinases. 

There are three primary types of kinases, categorized by the amino acid 
residue to which they catalytically transfer a phosphate group. Serine/threonine 
kinases transfer phosphate molecules to the alcoholic moiety of either serine or 
threonine residues within a polypeptide. Tyrosine kinases catalyze the transfer of 
30 phosphate to the phenolic moiety of tyrosine residues. Dual specificity kinases are 
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capable of catalyzing the transfer of phosphate to serine, threonine, or tyrosine 

residues within a polypeptide. 

Kinases can respond to extracellular signals, such, as hormones, growth 
factors, pheromones. cytokines, or neurotransmitters. These extracellular signaling 
5 molecules, which allow cell-to-cell signaling, bind to specific receptors on the cell 
membrane, in the cytosol or in the nucleus (e.g.. lipophilic hormones), forming 
receptor-ligand complexes. Kinase activity is also induced in response to 
environmental conditions such as ultraviolet radiation or stress, or in response to 
cell-cycle stimuli. 

1 0 When induced, kinases activate a variety of substrate molecules including 

enzymes, regulatory proteins, receptors, cytoskeletal proteins, transcription factors, 
ion channels and pumps. There are also kinases which are capable of phosphorylating 
themselves, a process known as autophosphorylation. In all forms of 
phosphorylation, the biological activity of each substrate is altered as the result of 
1 5 phosphorylation. Phosphorylated substrate molecules generally remain active until 
they are "turned off by phosphatases which dephosphorylate them. 

Protein kinases play a significant role in both B-cell and T-cell activation, as 
well as many phases of the immune response. The biological activity of many 
cytokines, including interleukin 1 (IL-1) and tumor necrosis factor (TNF), depend 
20 heavily on kinase-catalyzed protein phosphorylation. The binding of either of these 
cytokines to their respective receptors is known to induce rapid phosphorylation of 
several cytosolic proteins, such as the inhibitor of nuclear factor kappa B (NF-kB), 
heat shock protein 27 (hsp27), and mitogen-activated protein kinases (MAPK). 
(Geusdon et al., J. Biol. Chem. 272:30017, 1997). 
25 Protein kinases have also been shown to be significantly involved in cell cycle 

regulation. The centrosome, which plays a key role in cell division, undergoes a series 
of morphological and functional changes during the cell cycle. Centrosomes, which 
gives rise to the poles of the mitotic spindle apparatus, consist of a pair of centrioles 
surrounded by an amorphous structure known as pericentriolar material (PCM) from 
30 which microtubules are nucleated. In late G,/S phase of the cell cycle, centrioles 
undergo semi-conservative replication. During the S and G 2 phases, the centrosome 
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enlarges and "matures" as additional PCM proteins are recruited. The duplicated 
centrosomes physically separate and migrate to opposite ends of the nucleus at the 
transition of the G 2 to M phases. Additionally, the centrosomes abruptly increase 
their microtubule-nucleating capacity at the onset of mitosis. Many of these events 
5 are believe to be the result of kinase-catalyzed phosphorylation of critical centrosomal 
proteins, as protein kinases have been implicated in the duplication, maturation and 
separation of centrosomes during the cell cycle and are thought to regulate the 
centrosomal microtubule nucleation capacity. (Fry et al., J. Cell Biol., 141:1563, 
1998). 

10 As our understanding of kinases and signal transduction pathways increases, 

means for interceding in the progression of certain disease states are beginning to 
emerge. Once a kinase and its substrate are identified, it will be possible to either 
inhibit activity through the use of an antagonist or to enhance the kinase's activity 
using an agonist. Kinase activity may also be enhanced by increasing expression of 
1 5 the kinase gene or by the addition of exogenous kinase. Through activating or 
inhibiting kinase activity, biological effects can be regulated such that many 
pathologic conditions may be improved or remedied. 

One area in which the modulation of kinase activity may play a role is in the 
vascular system. The regulated development and maintenance of a functional 
20 vascular system is essential for fetal and post-natal life. For example, mouse 

mutations that block or compromise vasculogenesis (i.e., the development of vessels 
from vascular progenitor cells) or angiogenesis (i.e., the formation of capillaries from 
pre-existing vessels) generally result in embryonic lethality at various stages of 
development between e8.5 and el3.5 (days of embryonic development). During adult 
25 life, the regulation of vascularization is critical in normal organ homeostasis and 
during wound repair. In contrast, excessive vascularization is associated with, and 
contributes to, a number of inflammatory disorders, including arthritis, psoriasis and 
diabetic retinopathy. Additionally, the survival of tumors beyond a finite size is 
strictly dependent upon recruitment of blood vessels into a tumor site. Thus agents 
30 that promote or attenuate blood vessel development have multiple applications in the 
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treatment of vascular disorders or diseases in which dysregulated vascularization 
plays a critical role in pathogenesis. 

Consequently, there is a continuing need for substances and methods for 
regulating vascularization processes. The discovery that the novel kinase, GNK, and 
5 by implication its physiological substrate sGNK, affects vascular development now 
provides us with a means for regulating vascularization. The present invention 
provides methods for regulating vascularization, including isolated novel kinase and 
sGNK as well as agonists and antagonists of their biological activity. 

SUMMARY OF THE [NVT\TIO\ 
10 The present invention provides a novel use for GNK in regulating 

vascularization, for example, in treating pathological conditions related to 
oncogenesis or in circumstances where it is important to induce vasculogenesis. 
Methods for treating vascularization disorders, both undervascularization and 
inappropriate blood vessel development, using GNK, its agonists and antagonists are 
1 5 disclosed. The present invention also provides isolated DNA encoding sGNK, 

expression vectors comprising the isolated DNA, and a method for producing sGNK 
. by cultivating host cells containing the expression vectors under conditions 
appropriate for expressing sGNK. Antibodies directed against sGNK, or an 
immunogenic fragment thereof, are also disclosed. The sGNK, which is a 
20 physiological substrate of GNK and co-purifies with GNK through an ammonium 
sulphate precipitation and seven subsequent chromatographic purification steps, may 
also be useful in treating vascularization abnormalities. 

The invention includes an isolated human nucleic acid molecule comprising 
the DNA sequence of SEQ ID NO: 1 and isolated polypeptides having the amino acid 
25 sequence of SEQ ID NO: 2 and variants thereof due to the addition, deletion, or 

substitution of one or more amino acids. The invention also encompasses nucleic acid 
molecules that hybridize with the DNA sequence of SEQ ID NO: 1. A preferred set 
of hybridization conditions are those of moderate stringency, i.e., in 50% formamide 
and 6 x SSC, at 42 °C with washing conditions of 0.5 x SSC, 0.1% sodium dodecyl 
30 sulfate (SDS) at 60°C. 
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The present invention also encompasses an isolated human nucleic acid 

sequence encoding a sufficient number of amino acids of SEQ ID NO: 4 to confer on 
a GNK polypeptide the potential to regulate vascularization in mammals and an 
isolated human nucleic acid molecule comprising a sufficient number of nucleotides 
5 from SEQ ID NO: 3 to encode a GNK polypeptide that enhances vascularization. 

Expression vectors comprising the sGNK DNA sequences are provided, as 
well as methods for producing recombinant sGNK by culturing host cells under 
conditions appropriate for expressing sGNK. and for expressing a polypeptide having 
vascularization regulatory activity are provided. 

10 The present invention also provides methods for identifying a compound that 

modulates GNK-sGNK interaction or phosphorylation of sGNK by GNK. These 
methods comprise contacting candidate compounds with GNK and sGNK under 
conditions that allow the interaction or the phosphorylation to occur and then 
measuring the ability of the candidate compound to modulate interaction between 

1 5 GNK and sGNK or phosphorylation of sGNK by GNK. Compounds identified by 
these methods will be useful for further study and may have many in vivo and in vitro 
applications. 

Also provided is a method of identifying a compound that modulates 

vascularization comprising contacting a candidate compound with GNK or sGNK and 
20 measuring the ability of the compound to modulate a biological activity of GNK or 

sGNK. Compounds identified by this method will be useful for further study and may 

have many in vivo and in vitro applications. 

The present invention also encompasses nonhuman transgenic embryos, 

fetuses, and animals that are heterozygous for a GNK targeted mutation, nonhuman 
25 GNK-deficient embryos and fetuses produced by crossing such heterozygous animals, 

and cells from these embryos, fetuses, and animals. Cells deficient in GNK or sGNK 

are also provided. 

Additional features and advantages of the invention will be set forth in the 
description which follows, and in part will be apparent from the description, or may 
30 be learned by practice of the invention. The objectives and other advantages of the 
invention will be realized and attained by the methods, the recombinant vectors and 
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proteins, and the pharmaceutical compositions particularly pointed out in the written 

description and claims hereof, as well as the appended drawings. 

It is to be understood that both the foregoing general description and the 
following detailed description are exemplary and explanatory and are intended to 
provide further explanation of the invention as claimed. The accompanying figures 
are included to provide a further understanding of the invention and are incorporated 
in and constitute a pan of this specification. These figures illustrate several 
embodiments of the invention and, together with the description, serve to explain the 
principles of the invention. 

Throughout this specification many documents are cited. All of these 
documents are hereby specifically incorporated by reference. 

BRIEF DESCRIPTION OF THE DRAWINGS 

This invention will be more fully described with reference to the drawings in 

which: 

Figure 1 is the nucleotide sequence of sGNK, SEQ ID NO: 1, including the 
coding sequence (base pair 75-2549), the 5' untranslated region (UTR) and the 3' 
UTR; 

Figure 2 is the predicted amino acid sequence of sGNK, SEQ ID NO: 2; 

Figure 3 is the nucleotide sequence of GNK, SEQ ID NO: 3; 

Figure 4 is the predicted amino acid sequence of GNK, SEQ ID NO: 4; 

Figure 5 is the schematic representation of the domains of GNK and sGNK 
that depicts for GNK the serine/threonine kinase domain, the GEF homology domain, 
the gly/glu-rich linker, and the unique region, and for sGNK, predicted regions of 
moderate or high coiled-coil probability; 

Figure 6 is a tabular representation of the domains and structural features of 
GNK, SEQ ID NO: 4, that depicts the putative kinase domain (residues 44-315), the 
GEF homology domain (residues 318-605), the glycine/acidic-rich tether (residues 
752-764; also referred to as the gly/glu-rich linker), and the unique carboxy-terminal 
domain (residues 765-979); 
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Figure 7 is a sequence alignment of sGNK, SEQ ID NO: 2, with a human 

homolog of DrosopluLi Bicaudal-D, SEQ ID NO: IS. and the human centrosomai 
protein C-Napl, SEQ ID NO: 19; 

Figure 8 is an autoradiogram of an SDS-PAGE gel demonstrating that 
5 sGNK is phosphorylated by GNK, which is also autophosphorylated; 

Figure 9 is a chromatographic profile, silver-stained polyacrylamide gel and an 
autoradiogram depicting the copurification of GNK and sGNK by HPLC on a 
microbore MonoQ column. 

Figure 10 is a map of the GNK genomic locus encoding exons 1 and 2, the 
10 homologous recombinant vector, and the positive control vector used in constructing a 
GNK gene targeting vector. 

Figure 1 1 shows the vascularization of yolk sacs from GNK sufficient 
(GNK+/+, Figure 1 1 A, top) and GNK deficient (GNK-/-, Figure 1 1B ? bottom) fetuses. 
DETAILED DESCRIPTION OF THE INVENTION 
15 Throughout the specification various documents, including articles, books, patents, 

and patent applications, are cited. All of these documents are hereby incorporated by 
reference. 

The nucleotide and amino acid sequence of GNK (identified therein as ITAK) 
were originally disclosed in U.S. Application No. 08/870,529, which is herein 

20 incorporated by reference. Subsequent characterization studies by the instant 

inventors have identified a novel role for GNK in angiogenesis or neovascularization. 
Additionally, a second polypeptide, sGNK, which is a physiological substrate of and 
co-purifies with GNK, has been characterized and its nucleotide and amino acid 
sequence are disclosed. The sequences of GNK and sGNK were individually 

25 compared to non-redundant protein and nucleotide database sequences (National Ctr. 
For Biotechnol. Information (NCBI), Bethesda, MD) using the BLAST algorithm 
(Altschul et ah, J. MoL Biol 275:403, 1990). 

GNK was found to contain an N-terminal kinase domain, followed by a 
domain homologous to the Guanine nucleotide Exchange Factor (GEF) family of 

30 proteins, a short glycine/acidic-rich tether region and a C-terminal domain of 

unknown function with no significant homology to any known sequences. {See Figs. 5 
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and 6). The sequence of sGNK, an approximately 90 kilodalton (kDa) protein 
predicted to have a high degree of coiled-coi! structure, is similar to the 
Drosopiula Bicaudal-D gene and has region of high homology with a newly 
discovered protein, C-Napl. (See Figures 5 and 7). 
5 GNK 

GNK is a protein kinase with an approximate molecular weight of 1 10 kDa. 
that will phosphorylate itself (autophosphorylation), sGNK, and possibly other 
undetermined physiological substrates under appropnate conditions. (See Fig. 8). 
Phosphorylated-GNK demonstrates a strong tendency to oligomers. Based on SDS- 
1 0 PAGE and Superdex 200 gel filtration chromatography analyses, phosphorylated- 
GNK forms tnmers and also higher-order complexes. 

The kinase domain of GNK is most similar to the NIMA family of kinases 
particularly Nek2 (NIMA-related kinase 2), a dual specificity kinase associated with 
regulation of the cell cycle. Nek2 associates with the centrosomes of all cells during 
15 all stages of the cell cycle and has been shown to be a bona fide component of the 
core centrosome. (Fry et al., EMBOJ. 17:470,1998). Overexpression of Nek2 
results in splitting of the centrosome, dispersal of centrosomal material, and 
interference with microtubule regrowth, which profoundly affects centrosome 
structure and activity. (Id.). It has been proposed that Nek2 plays a role in severing 
20 the connection between the two duplicated centrosomes prior to the onset of mitosk 
by phosphorylating centrosomal "glue" proteins. (Id.). 

The "GEF-hke" domain of GNK is located between the kinase domain and the 
glycine/acidic-rich tether region. GEF proteins are activators of the Ras superfamily 
of proteins. (Overbeck et al., Mol. Repro. and Dev., 42:468, 1995). Members of the 
25 Ras superfamily are critical downstream components in signal transduction pathways 
that are initiated by the binding of extracellular ligands to transmembrane receptors 
possessing tyrosine kinase activity. Ras superfamily proteins are GTPases, which 
bind and hydrolyze GTP. They have been shown to regulate a wide variety of cellular 
activities, such as cell proliferation and differentiation, cytoskeletal organization 
30 nuclear transport, and cell cycle regulation. Ras superfamily proteins are active when 
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GTP is bound and inactive when GDP is bound. GEFs are positive regulators of Ras 
activity, promoting the release of bound GDP and facilitating GTP binding. 

The giycine/acidic-rich tether is a scries of nine consecutive glycine residues 
followed by three glutamic acid residues and an aspartic acid residue. This region of 
5 thirteen contiguous amino acid residues is located at position 752-764 of the GNK 
amino acid sequence. (See Figure 6). This region is believed to serve as a molecular 
linker or separator that would effectively isolate the downstream novel C-terminal 
domain from the remainder of the molecule. 

To further characterize GNK, cells were generated that lacked functional 
10 GNK. Attempts to develop GNK-deficient, or "knock-out", mice by crossing GNK 
heterozygotes (GNK +/-) were unsuccessful as the homozygous phenotype was lethal. 
Viable GNK -/- fetuses were present at the expected frequency between e9.5-l 1.5, but 
were under-represented by el 3. 5. The GNK null fetuses were growth retarded, i.e., 
approximately 50% smaller than wild-type littermates by el3.5. The GNK-deficient 
1 5 fetuses also displayed reduced vascularization in the yolk sac and placenta, indicating 
that GNK plays a critical role in angiogenesis and vascular biology. GNK -/- murine 
fibroblasts were generated from viable el 1.5-13.5 GNK -/- fetuses. 

These data suggest that inhibitors of GNK may be useful in inhibiting 
vascularization (i.e., angiogenesis and vasculogenesis). Inhibitors of angiogenesis 
20 will be clinically beneficial in those cases where excessive blood vessel development 
is detrimental. For example, arresting vascularization may be useful in treating 
proliferative retinopathy, which can lead to vision loss in diabetics and premature 
infants. Angiogenesis inhibitors may arrest malignant tumor development at primary 
and secondary sites by reducing tumor vascularization. Additionally, angiogenesis 
25 inhibitors may be useful in limiting the development and spread of warts and benign 
tumors. In the absence of a blood supply, a tumor cannot grow beyond 1-2 mm in 
size. Such inhibitors may also be useful in treating other disorders associated with 
inappropriate blood vessel development, including arthritis and psoriasis. 

Activators of GNK may be useful in stimulating blood vessel development in 
30 those cases where this might be advantageous, for example, during wound repair and 
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cardiac dysfunction. GNK activators may also be useful in stimulating the 

revascularization of brain tissue following stroke. 
sGNK 

sGNK shows a high degree of sequence homology with the Bicaudal-D gene 
5 of Drosophila. The Bicaudal-D gene encodes a cytoskeleton-like coiled coil 

polypeptide with a leucine zipper and five a-helix domains. (Baens and Marynen, 
Genomics, 45:601. 1997). Mutations m bicaudal-D disrupt the cytoskeleton, interfere 
with messenger RNA (mRNA) sorting, and d 1S rupt the polarity of the developing 
embryo. (Id. ). A human homologue of bicaudal-D has recently been reported and 
1 0 there is evidence to suggest there may be additional human homologs. (Id.). 

sGNK also contains a region that is highly homologous to C-Napl, a novel 
centrosomal coiled coil protein that appears to be the substrate of Nek2. (Fry et al., J. 
Cell Biol. 141:1563, 1998). C-Napl, like Nek2, is a core component of the human 
centrosome, that associates with centrosomes independently of microtubules. (Id.). 
1 5 C-Nap I and Nek2 are known to co-localize in the centrosome and both have been 
detected in all cell types examined. (Id.). A recent model suggests that C-Napl may 
function as part of the centrosomal "glue", by linking the ends of centrioles to each 
other during interphase. C-Napl is believed to be phosphorylated by Nek2 at the 
onset of mitosis, causing C-Napl to depolymerize or degrade which in turn permits 
20 the centrosomes to split during mitosis. 

sGNK co-purifies with GNK, suggesting they may form a higher-order 
complex. sGNK is phosphorylated by GNK in vitro, suggesting an interaction similar 
to that seen with Nek2 and C-Napl . Activators and inhibitors of sGNK may thus be 
useful in enhancing or decreasing angiogenesis. Inhibitors or sGNK phosphorylation 
25 will block subsequent biological activities of the protein, such as interfering with cell 
division or differentiation or blocking a signaling pathway. Activators of sGNK are 
expected to enhance its biological properties. 
NUCLEIC AfTD MOLECULES 

In a particular embodiment, the invention relates to certain isolated nucleotide 
30 sequences that are free from contaminating endogenous material. A "nucleotide 

sequence" refers to a polynucleotide molecule in the form of a separate fragment or as 
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a component of a larger nucleic acid construct. The nucleic acid molecule has been 

derived from DNA or RNA isolated at ieast once in substantially pure form and in a 
quantity or concentration enabling identification, manipulation, and recovery of its 
component nucleotide sequences by standard biochemical methods (such as those 

5 outlined in Sambrook et al.. Molecular Cloning: A Laboratory Manual, 2nd sed., 

Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989). Such sequences are 
preferably provided and/or constructed in the form of an open reading frame 
uninterrupted by internal non-translated sequences, or nitrons, that are typically 
present in eukaryotic genes. Sequences of non-translated DNA can be present 5' or 3' 
0 from an open reading frame, where the same do not interfere with manipulation or 
expression of the coding region. 

Nucleic acid molecules of the invention include DNA in both single-stranded 
and double-stranded form, as well as the RNA complement thereof. DNA includes, 
for example, cDNA, genomic DNA, chemically synthesized DNA, DNA amplified by 

1 5 PCR, and combinations thereof The DNA molecules of the invention include full 
length genes as well as polynucleotides and fragments thereof Genomic DNA may 
be isolated by conventional techniques, e.g., using the cDNA of SEQ ID NO:l, SEQ 
ID NO:3, or suitable fragments thereof, as a probe. The GNK and sGNK nucleic 
acids of the invention are preferentially derived from human sources, but the 

20 invention includes those derived from other mammalian species, as well. 
Preferred Sequences 

Particularly preferred nucleotide sequences of the invention are SEQ ID NO: 1 
and SEQ ID NO: 3, which encode sGNK and GNK, respectively, as set forth above. 
A clone having the nucleotide sequence of SEQ ID NO: 1 was isolated as described in 

25 Example 2. The amino acid sequence encoded by the DNA of SEQ ID NO: 1 is 

shown in SEQ ID NO: 2. This sequence identifies sGNK, the physiological substrate 
of GNK. A clone having the nucleotide sequence of SEQ ID NO: 3 has also been 
isolated. The amino acid sequence encoded by the DNA of SEQ ID NO: 3 is shown 
in SEQ ID NO: 4. This sequence identifies GNK as a member of the protein kinase 

30 superfamily. 
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Additional Sequences 

Due to the known degeneracy of the genetic code, wherein more than one 
codon can encode the same amino acid, a DNA sequence can vary- from that shown in 
SEQ ID NO: 1 or SEQ ID NO: 3. and still encode a polypeptide having the amino 
5 acid sequence of SEQ ID NO: 2 or SEQ ID NO:4. respectively. Such variant DNA 
sequences can result from silent mutations (e.g., occurring during PCR amplification), 
or can be the product of deliberate mutagenesis of a native sequence. 

The invention thus provides isolated DNA sequences encoding polypeptides 
of the invention, selected from: (a) DNA comprising the nucleotide sequence of SEQ 
10 ID NO: 1 or SEQ ID NO: 3; (b) DNA encoding the polypeptides of SEQ ID NO: 2 or 
SEQ ID NO: 4; (c) DNA capable of hybridizing with a DNA of (a) or (b) under 
conditions of moderate stringency and which encodes polypeptides of the invention; 
(d) DNA capable of hybridizing with a DNA of (a) or (b) under conditions of high 
stringency and which encodes polypeptides of the invention; and (e) DNA which is 
1 5 degenerate as a result of the genetic code to a DNA defined in (a), (b), (c), or (d) and 
which encode polypeptides of the invention. Of course, polypeptides encoded by 
such DNA sequences are encompassed by the invention. 

As used herein, conditions of moderate stringency can be readily determined 
by those having ordinary skill in the art based on, for example, the length of the DNA. 
20 The basic conditions are set forth by Sambrook et al. Molecular Cloning: A 

Laboratory Manual, 2 ed. Vol. 1, pp. 1.101-104, Cold Spring Harbor Laboratory 
Press, (1989), and include use of apre-washing solution for the nitrocellulose filters 
5X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0), hybridization conditions of about 50% 
formamide, 6X SSC at about 42°C (or other similar hybridization solution, such as 
25 Stark's solution, in about 50% formamide at about 42'C), and washing conditions of 
about 60°C, 0.5X SSC, 0.1% SDS. Conditions of high stringency can also be readily 
determined by the skilled artisan based on, for example, the length of the DNA. 
Generally, such conditions are defined as hybridization conditions as above, and with 
washing at approximately 68"C, 0.2X SSC, 0.1% SDS. The skilled artisan will 
30 recognize that the temperature and wash solution salt concentration can be adjusted as 
necessary according to factors such as the length of the probe. 
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Also included as an embodiment of the invention is DNA encoding 
polvpeptidc fragments and polypeptides comprising inactivated site(s) for 
myristovlation, palmitoylation, prenylation (supporting the thioether linkage of a 
farnesyl or geranylgeranyl moiety) or glycosyl phosphatidylinositol (GPI) linkage, 
inactivated protease processing site(s), or conservative amino acid substitution(s). 

In another embodiment, the nucleic acid molecules of the invention also 
comprise nucleotide sequences that are at least 80% identical to a native sequence. 
Also contemplated are embodiments in which a nucleic acid molecule comprises a 
sequence that is at least 90% identical, at least 95% identical, at least 98% identical, at 
least 99% identical, or at least 99.9% identical to a native sequence. 

The percent identity may be determined by visual inspection and mathematical 
calculation. Alternatively, the percent identity of two nucleic acid sequences can be 
determined by comparing sequence information using the GAP computer program, 
version 6.0 described by Devereux et al. (NucL Acids Res. 12:387, 1984) and 
available from the University of Wisconsin Genetics Computer Group (UWGCG). 
The preferred default parameters for the GAP program include: (1) a unary 
comparison matrix (containing a value of 1 for identities and 0 for non-identities) for 
nucleotides, and the weighted comparison matrix of Gribskov and Burgess, NucL 
Acids Res. 14:6745, 1986, as described by Schwartz and Dayhoff, eds., Atlas of 
Protein Sequence and Structure, National Biomedical Research Foundation, pp. 353- 
358, 1979; (2) a penalty of 3.0 for each gap and an additional 0.10 penalty for each 
symbol in each gap; and (3) no penalty for end gaps. Other programs used by persons 
skilled in the art of sequence comparison, such as those employing the BLAST 
algorithm, may also be employed. 

The invention also provides isolated nucleic acids useful in the production of 
polypeptides. Such polypeptides may be prepared by any of a number of 
conventional techniques. The DNA sequence encoding the GNK or sGNK 
polypeptides, or desired fragments thereof, may be subcloned into an expression 
vector for production of the polypeptide or fragment. The DNA sequence 
advantageously is fused to a sequence encoding a suitable leader or signal peptide. 
Alternatively, the desired fragment may be chemically synthesized using known 
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techniques. DNA fragments also may be produced by restriction endonuclease 

digestion of a full length cloned DNA sequence, and isolated by electrophoresis on 
agarose gels. If necessary, oligonucleotides that reconstruct the 5' or 3' terminus to a 
desired point may be ligated to a DNA fragment generated by restriction enzyme 
5 digestion. Such oligonucleotides may additionally contain a restriction endonuclease 
cleavage site upstream of the desired coding sequence, and position an initiation 
codon (ATG) at the N-terminus of the coding sequence. 

The well-known polymerase chain reaction (PCR) procedure also may be 
employed to isolate and amplify a DNA sequence encoding a desired protein 
10 fragment. Oligonucleotides that define the desired termini of the DNA fragment are 
employed as 5' and 3' pnmers. The oligonucleotides may additionally contain 
recognition sites for restriction endonucleases, to facilitate insertion of the amplified 
DNA fragment into an expression vector. PCR techniques are described in Saiki et 
al., Science 239:487 (19SS); Recombinant DNA Methodology, Wu et al., eds., 
15 Academic Press, Inc., San Diego (1989), pp. 189-196; and PCR Protocols: A Guide 
to Methods and Applications, Innis et al., eds., Academic Press, Inc. (1990). 
POLYPEPTTOFS AND FRAGMENTS THEREOF 

The invention encompasses GNK and sGNK polypeptides and fragments 
thereof in various forms, including those that are naturally occurring or produced 
20 through various techniques such as procedures involving recombinant DNA 

technology. Such forms include, but are not limited to, derivatives, variants, and 
oligomers, as well as fusion proteins or fragments thereof. 
Polypeptides and Fragments Thereof 

The polypeptides of the invention include full length proteins encoded by the 
25 nucleic acid sequences set forth above. Polypeptide fragments of these nucleotide 
sequences are also intended to be within the scope of the invention. For example, a 
particular polypeptide fragment of GNK has been identified with kinase activity 
approximately 3-4 times higher than full length GNK. This particularly preferred 
fragment comprises amino acids 2 to 340 of SEQ ID NO: 4. 
30 The polypeptide of SEQ ID NO: 4 includes an N-terminal leader region of 43 

amino acids followed by a kinase domain comprising amino acids 44 to 3 1 5, a 
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Guanine nucleotide Exchange Factor (GEF) homology region comprising amino acids 

318 through 605. a short glycine/acidic-rich tether region comprising amino acids 752 
through 764 and a novel C-terminal cytoplasmic domain having no significant 
homology with any sequences in the computer data bases, comprising amino acids 
5 765 to 979. A spacer region comprises amino acids 606 to 75 1 . 

The skilled artisan will recognize that the above-described boundaries of such 
regions of the polypeptide are approximate and that the boundaries of the kinase 
domain (which may be predicted by using computer programs available for that 
purpose) may differ from those described above. 

10 The polypeptides of the invention may be cytosolic or they may be genetically 

engineered to be secreted, i.e., capable of being secreted from the cells in which they 
are expressed. In general, secreted polypeptides may be identified (and distinguished 
from cytosolic counterparts) by separating intact cells which express the desired 
polypeptide from the culture medium, e.g., by centrifugation, and assaying the 

15 medium (supernatant) for the presence of the desired polypeptide. The presence of 
polypeptide in the medium indicates that the polypeptide was secreted from the cells. 

In general, the use of soluble forms is advantageous for certain applications. 
Purification of such polypeptides from recombinant host cells is facilitated, since they 
are secreted from the cells. Further, secreted polypeptides may be preferable for 

20 therapeutic administration. 

Also provided herein are polypeptide fragments comprising at least 20, or at 
least 30, contiguous amino acids of the sequence of SEQ ID NO: 2 or of SEQ ID NO: 
4. Fragments derived from these amino acid sequences find use in studies of signal 
transduction, in regulating cellular processes associated with transduction of 

25 biological signals, and in vascular biology studies. Polypeptide fragments may also 
be employed as immunogens, in generating antibodies. 
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Variants 

Naturally occurring variants as well as derived variants of the polypeptides 
and fragments are provided herein. 

Variants may exhibit amino acid sequences that are at least 80% identical. 
5 Also contemplated are embodiments in which a polypeptide or fragments thereof 
comprises an amino acid sequence that is at least 90% identical, at least 95% 
identical, at least 98% identical, at least 99% identical, or at least 99.9% identical to 
the preferred polypeptide or fragment thereof. Percent identity may be determined by 
visual inspection and mathematical calculation. Alternatively, the percent identity of 
1 0 two protein sequences can be determined by comparing sequence information using 
the GAP computer program, based on the algorithm of Needleman and Wunsch (J. 
Mol. Bio. 48:443, 1970) and available from the University of Wisconsin Genetics 
Computer Group (UWGCG). The preferred default parameters for the GAP program 
include: (1) a scoring matrix, blosum62, as described by Henikoff and Henikoff 
1 5 (Proc. Natl. Acad. Sci. USA 89: 1 09 1 5, 1 992); (2) a gap weight of 1 2; (3) a gap 
length weight of 4; and (4) no penalty for end gaps. Other programs used by one 
skilled in the art of sequence comparison may also be used. 

The variants of the invention include, for example, those that result from 
alternate mRNA splicing events or from proteolytic cleavage. Alternate splicing of 
20 mRNA may, for example, yield a truncated but biologically active protein, such as a 
naturally occurring soluble form of the protein or a variant lacking a regulatory 
component. Variations attributable to proteolysis include, for example, differences in 
the N- or C-termini upon expression in different types of host cells, due to proteolytic 
removal of one or more terminal amino acids from the protein (generally from 1-5 
25 terminal amino acids). Proteins in which differences in amino acid sequence are 

attributable to genetic polymorphism (allelic variation among individuals producing 
the protein) are also contemplated herein. 

Additional variants within the scope of the invention include polypeptides that 
may be modified to create derivatives thereof by forming covalent or aggregative 
30 conjugates with other chemical moieties, such as glycosyl groups, lipids, phosphate, 
acetyl groups and the like. Covalent derivatives may be prepared by linking the 
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chemical moieties to functional groups on amino acid side chains or at the N-terminus 

or C-terminus of a polypeptide. Conjugates comprising diagnostic (detectable) or 
therapeutic agents attached thereto are contemplated herein, as discussed in more 
detail below. 

5 Other derivatives include covalent or aggregative conjugates of the 

polypeptides with other proteins or polypeptides, such as by synthesis in recombinant 
culture as N-terminal or C-terminal fusions. Examples of fusion proteins are 
discussed below in connection with oligomers. Further, fusion proteins can comprise 
peptides added to facilitate purification and identification. Such peptides include, for 

10 example. poly-His or the antigenic identification peptides described in U.S. Patent 
No. 5,01 1,912 and in Hopp et al., Bio/Technology 6:1204, 1988. One such peptide is 
the FLAG (g> peptide, Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys, which is highly antigenic 
and provides an epitope reversibly bound by a specific monoclonal antibody, enabling 
rapid assay and facile purification of expressed recombinant protein. A murine 

15 hybridoma designated 4E1 1 produces a monoclonal antibody that binds the FLAG® 
peptide in the presence of certain divalent metal cations, as described in U.S. Patent 
5,01 1,912, hereby incorporated by reference. The 4E1 1 hybridoma cell line has been 
deposited with the American Type Culture Collection under accession no. HB 9259. 
Monoclonal antibodies that bind the FLAG® peptide are available from Eastman 

20 Kodak Co., Scientific Imaging Systems Division, New Haven, Connecticut. 

Among the variant polypeptides provided herein are variants of native 
polypeptides that retain the native biological activity or the substantial equivalent 
thereof. One example is a variant that binds with essentially the same binding affinity 
as does the native form. Binding affinity can be measured by conventional 

25 procedures, e.g., as described in U.S. Patent No. 5,5 12,457 and as set forth below. 

Variants include polypeptides that are substantially homologous to the native 
form, but which have an amino acid sequence different from that of the native form 
because of one or more deletions, insertions or substitutions. Particular embodiments 
include, but are not limited to, polypeptides that comprise from one to ten deletions, 

30 insertions or substitutions of amino acid residues, when compared to a native 
sequence. 
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A given amino acid may be replaced, for example, by a residue having similar 
physiochemical characteristics. Examples of such conservative substitutions include 
substitution of one aliphatic residue for another, such as He, Val, Leu, or Ala for one 
another; substitutions of one polar residue for another, such as between Lys and Arg. 
5 Glu and Asp, or Gin and Asn; or substitutions of one aromatic residue for another, 
such as Phe, Trp, or Tyr for one another. Other conservative substitutions, e.g., 
involving substitutions of entire regions having similar hydrophobicity characteristics, 
arc well known. 

Similarly, the DXAs of the invention include variants that differ from a native 
10 DNA sequence because of one or more deletions, insertions or substitutions, but that 
encode a biologically active polypeptide. 

The invention further includes polypeptides of the invention with or without 
associated native-pattern lipid anchors, e.g., myristoylation, palmitoylation, 
prenylation, etc. Post-translational modifications such as the enzyme-catalyzed 
15 addition of myristic or palmitic acid residues or GPI anchors cause the modified 
protein to become membrane-bound. 

Correspondingly, similar DNA constructs that encode various additions or 
substitutions of amino acid residues or sequences, or deletions of terminal or internal 
residues or sequences are encompassed by the invention. For example, myristylation 
20 addition sites in the polypeptide can be modified to preclude myristylation, allowing 
expression of a fatty acid- free analog in mammalian and yeast expression systems. 

In another example of variants, sequences encoding Cys residues that are not 
essential for biological activity can be altered to cause the Cys residues to be deleted 
or replaced with other amino acids, preventing formation of incorrect intramolecular 
25 disulfide bridges upon folding or renaturation. 

Other variants are prepared by modification of adjacent dibasic amino acid 
residues, to enhance expression in yeast systems in which KEX2 protease activity is 
present. EP 212,914 discloses the use of site-specific mutagenesis to inactivate KEX2 
protease processing sites in a protein. KEX2 protease processing sites are inactivated 
30 by deleting, adding or substituting residues to alter Arg-Arg, Arg-Lys, and Lys-Arg 
pairs to eliminate the occurrence of these adjacent basic residues. Lys-Lys pairings 
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are considerably less susceptible to KEX2 cleavage, and conversion of Arg-Lys or 

Lys-Arg to Lys-Lys represents a conservative and preferred approach to inactivating 
KEX2 sites. 

Oligomers 

5 Encompassed by the invention homo- and hetero-oligomers of GNK and/or 

sGNK. or fusion proteins that contain GNK and/or sGNK polypeptides. Such 
oligomers may be in the form of covalently-linked or non-covalently-linked 
multimers, including dirners. trimers, or higher oligomers. In one aspect of the 
invention, the oligomers maintain the binding ability of the polypeptide components 

10 and provide binding sites that are bivalent, trivalent, etc. 

One embodiment of the invention is directed to oligomers comprising multiple 
polypeptides joined via covalent or non-covalent interactions between peptide 
moieties fused to the polypeptides. Such peptides may be peptide linkers (spacers), or 
peptides that have the property of promoting oligomerization. Leucine zippers and 

15 certain polypeptides derived from antibodies are among the peptides that can promote 
oligomerization of the polypeptides attached thereto, as described in more detail 
below. 

Immunoglobulin-based Oligomers 
As one alternative, an oligomer is prepared using polypeptides derived from 
20 immunoglobulins. Preparation of fusion proteins comprising certain heterologous 
polypeptides fused to various portions of antibody-derived polypeptides (including 
the Fc domain) has been described, e.g., by Ashkenazi et al. (Proc. Natl. Acad. Set 
USA 88:10535, 1991); Byrn et al. {Nature 344:677, 1990); and Hollenbaugh and 
Aruffo ("Construction of Immunoglobulin Fusion Proteins", in Current Protocols in 
25 Immunology, Suppl. 4, pages 10.19.1 - 10.19.11, 1992). 

One embodiment of the present invention is directed to a dimer comprising 
two fusion proteins created by fusing a polypeptide of the invention to an Fc 
polypeptide derived from an antibody. A gene fusion encoding the polypeptide/Fc 
fusion protein is inserted into an appropriate expression vector. Polypeptide/Fc fusion 
30 proteins are expressed in host cells transformed with the recombinant expression 
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vector, and allowed to assemble much like antibody molecules, whereupon interchain 
disulfide bonds form between the Fc moieties to yield divalent molecules. 

The term f, Fc polypeptide" as used herein includes native and mutein forms of 
polypeptides made up of the Fc region of an antibody comprising any or all of the CH 
5 domains of the Fc region. Truncated forms of such polypeptides containing the hinge 
region that promotes dimerization are also included. Preferred polypeptides comprise 
an Fc polypeptide derived from a human IgGl antibody. 

One suitable Fc polypeptide, described in PCT application WO 93/10151 
(hereby incorporated by reference), is a single chain polypeptide extending from the 
10 N-terminal hinge region to the native C-terminus of the Fc region of a human IgGl 
antibody. Another useful Fc polypeptide is the Fc mutein described in U.S. Patent 
5,457,035 and in Baum et al., (EMBO J. 13:3992-4001, 1994) incorporated herein by 
reference. The amino acid sequence of this mutein is identical to that of the native Fc 
sequence presented in WO 93/1015 1, except that amino acid 19 has been changed 
15 from Leu to Ala, amino acid 20 has been changed from Leu to Glu, and amino acid 22 
has been changed from Gly to Ala. The mutein exhibits reduced affinity for Fc 
receptors. 

The above-described fusion proteins comprising Fc moieties (and oligomers 
formed therefrom) offer the advantage of facile purification by affinity 
20 chromatography using Protein A or Protein G columns. 

Peptide-linker Based Oligomers 
Alternatively, the oligomer is a fusion protein comprising multiple 
polypeptides, with or without peptide linkers (spacer peptides). Among the suitable 
peptide linkers are those described in U.S. Patents 4,751,180 and 4,935,233, which 
25 are hereby incorporated by reference. A DNA sequence encoding a desired peptide 
linker may be inserted between, and in the same reading frame as, the DNA sequences 
of the invention, using any suitable conventional technique. For example, a 
chemically synthesized oligonucleotide encoding the linker may be ligated between 
the sequences. In particular embodiments, a fusion protein comprises from two to 
30 four soluble GNK and/or sGNK polypeptides, separated by peptide linkers. 
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Leucine-Zippers 

Another method for preparing the oligomers of the invention involves use of a 
leucine zipper. Leucine zipper domains are peptides that promote oligomerization of 
the proteins in which they are found. Leucine zippers were originally identified in 
5 several DNA-binding proteins (Landschulz et al., Science 240:1759, 1988), and have 
since been found in a variety of different proteins. Among the known leucine zippers 
are naturally occurring peptides and derivatives thereof that dimerize or trimerize. 

The zipper domain (also referred to herein as an oligomerizing. or oligomer- 
forming, domain) comprises a repetitive heptad repeat, often with four or five leucine 

10 residues interspersed with other amino acids. Examples of zipper domains are those 
found in the yeast transcription factor GCN4 and a heat-stable DNA-binding protein 
found in rat liver (C/EBP; Landschulz et aL, Science 243: 1681, 1989). Two nuclear 
transforming proteins, fos and jun, also exhibit zipper domains, as does the gene 
product of the murine proto-oncogene, c-myc (Landschulz et ah, Science 240:1759, 

1 5 1988). The products of the nuclear oncogenes fos and jun comprise zipper domains 
that preferentially form heterodimer (O'Shea et aL, Science 245:646, 1989, Turner and 
Tjian, Science 243:1689, 1989). The zipper domain is necessary for biological 
activity (DNA binding) in these proteins. 

The fusogenic proteins of several different viruses, including paramyxovirus, 

20 coronavirus, measles virus and many retroviruses, also possess zipper domains 

(Buckland and Wild, Nature 338:547,1989; Britton, Nature 353:394, 1991; Delwart 
and Mosialos, AIDS Research and Human Retroviruses 6:703, 1990). The zipper 
domains in these fusogenic viral proteins are near the transmembrane region of the 
proteins; it has been suggested that the zipper domains could contribute to the 

25 oligomeric structure of the fusogenic proteins. Oligomerization of fusogenic viral 
proteins is involved in fusion pore formation (Spruce et al, Proc. Natl. Acad, Sci. 
U.S.A. 88:3523, 1991). Zipper domains have also been recently reported to play a 
role in oligomerization of heat-shock transcription factors (Rabindran et aL, Science 
259:230, 1993). 

30 Zipper domains fold as short, parallel coiled coils (O'Shea et aL, Science 

254:539; 1991). The general architecture of the parallel coiled coil has been well 
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10 



characterized, with a "knobs-into-holes" packing as proposed by Crick in 1953 (Acta 
Crystallogr. 6:689). The dirtier formed by a zipper domain is stabilized by the heptad 
repeat, designated (abc\iefg) n according to the notation of McLachlan and Stewart (J. 
Mol. Biol. 98:293; 19^5). in which residues a and d arc generally hydrophobic 
residues, with d being a leucine, which line up on the same face of a helix. 
Oppositely-charged residues commonly occur at positions g and e. Thus, in a parallel 
coiled coil formed from two helical zipper domains, the "knobs" formed by the 
hydrophobic side chains of the first helix are packed into the "holes" formed between 
the side chains of the second helix. 

The residues at position d (often leucine) contribute large hydrophobic 
stabilization energies, and are important for oligomer formation (Krystek: et al., Int. J. 
Peptide Res. 38:229, 1991). Lovejoy et al. (Science 259:1288, 1993) recently 
reported the synthesis of a triple-stranded cc-helical bundle in which the helices run 
up-up-down. Their studies confirmed that hydrophobic stabilization energy provides 
1 5 the main driving force for the formation of coiled coils from helical monomers. These 
studies also indicate that electrostatic interactions contribute to the stoichiometry and 
geometry of coiled coils. Further discussion of the structure of leucine zippers is 
found in Harbury et al. (Science 262:1401, 26 November 1993) 

Examples of leucine zipper domains suitable for producing soluble oligomeric 
proteins are described in PCT application WO 94/10308, and the leucine zipper 
derived from lung surfactant protein D (SPD) described in Hoppe et al. (FEBS Letters 
344:191, 1994), hereby incorporated by reference. The use of a modified leucine 
zipper that allows for stable trimerization of a heterologous protein fused thereto is 
described in Fanslow et al. (Semin. Immunol. 6:267-278, 1994). Recombinant fusion 
25 proteins comprising a soluble polypeptide fused to a leucine zipper peptide are 

expressed in suitable host cells, and the soluble oligomer that forms is recovered from 
the culture supernatant. 

Certain leucine zipper moieties preferentially form trimers. One example is a 
leucine zipper derived from lung surfactant protein D (SPD), as described in Hoppe et 
30 al. (FEBS Letters 344:191, 1994) and in U.S. Patent 5,716,805, hereby incorporated 
by reference in their entirety. This lung SPD-derived leucine zipper peptide 



20 
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comprises the amino acid sequence Pro Asp Val Ala Ser Leu Arg Gin Gin Val Glu 
Ala Leu Gin Gly Gin Val Gin His Leu Gin Ala Ala Phe Scr Gin Tyr. 

Another example of a leucine zipper that promotes trimerization is a peptide 
comprising the amino acid sequence .Arg Met Lys Gin He Glu Asp Lys He Glu Glu He 
5 Leu Ser Lys lie Tyr His lie Glu Asn Glu He Ala Arg He Lys Lys Leu He Gly Glu Arg, 
as described in U.S. Patent 5,716,805. In one alternative embodiment, an N-terminal 
Asp residue is added; in another, the peptide lacks the N-terminal Arg residue. 

Fragments of the foregoing zipper peptides that retain the property of 
promoting oligomerization may be employed as well. Examples of such fragments 
10 include, but are not limited to, peptides lacking one or two of the N-terminal or C- 
terminal residues presented in the foregoing amino acid sequences. Leucine zippers 
may be derived from naturally occurring leucine zipper peptides, e.g., via 
conservative substitution(s) in the native amino acid sequence, wherein the peptide's 
ability to promote oligomerization is retained. 
15 Other peptides derived from naturally occurring trimeric proteins may be 

employed in preparing trimeric GNK or sGNK preparations. Alternatively, synthetic 
peptides that promote oligomerization may be employed. In particular embodiments, 
leucine residues in a leucine zipper moiety are replaced by isoleucine residues. Such 
peptides comprising isoleucine may be referred to as isoleucine zippers, but are 
20 encompassed by the term "leucine zippers" as employed herein. 

PRODUCTION OF POLYPEPTIDES AND FRAGMENTS THEREOF 

Expression, isolation and purification of the polypeptides and fragments of the 
invention may be accomplished by any suitable technique, including but not limited to 
the following: 
25 Expression Systems 

The present invention also provides recombinant cloning and expression 
vectors containing DNA, as well as host cells containing the recombinant vectors. 
Expression vectors comprising DNA may be used to prepare the polypeptides or 
fragments of the invention encoded by the DNA. A method for producing 
30 polypeptides comprises culturing host cells transformed with a recombinant 

expression vector encoding the polypeptide, under conditions that promote expression 
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of the polypeptide, then recovering the expressed polypeptides from the culture. The 
skilled artisan will recognize that the procedure for purifying the expressed 
polypeptides will vary according 10 such factors as the type of host cells employed, 
and whether the polypeptide is membrane-bound or in a soluble, secreted form. 
5 Any suitable expression system may be employed. The vectors include a 

DNA encoding a polypeptide or fragment of the invention, operably linked to suitable 
transcriptional or translational regulatory nucleotide sequences, such as those derived 
from a mammalian, microbial, viral, or insect gene. Examples of regulatory 
sequences include transcriptional promoters, operators, or enhancers, an mRNA 
1 0 ribosomal binding site, and appropriate sequences which control transcription and 

translation initiation and termination. Nucleotide sequences are operably linked when 
the regulatory sequence functionally relates to the DNA sequence. Thus, a promoter 
nucleotide sequence is operably linked to a DNA sequence if the promoter nucleotide 
sequence controls the transcription of the DNA sequence. An origin of replication 
1 5 that confers the ability to replicate in the desired host cells, and a selection gene by 
which transformants are identified, are generally incorporated into the expression 
vector. 

In addition, a sequence encoding an appropriate signal peptide (native or 
heterologous) can be incorporated into expression vectors. A DNA sequence for a 

20 signal peptide (secretory leader) may be fused in frame to the nucleic acid sequence of 
the invention so that the DNA is initially transcribed, and the mRNA translated, into a 
fusion protein comprising the signal peptide. A signal peptide that is functional in the 
intended host cells promotes extracellular secretion of the polypeptide. The signal 
peptide is cleaved from the polypeptide upon secretion of polypeptide from the cell. 

25 The skilled artisan will also recognize that the position(s) at which the signal 

peptide is cleaved may differ from that predicted by computer program, and may vary 
according to such factors as the type of host cells employed in expressing a 
recombinant polypeptide. A protein preparation may include a mixture of protein 
molecules having different N-terminal amino acids, resulting from cleavage of the 

30 signal peptide at more than one site. 
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Suitable host cells for expression of polypeptides include prokaryotes, yeast or 
higher eukaryotic cells. Mammalian or insect cells are generally preferred for use as 
host cells. Appropriate cloning and expression vectors for use with bacterial, fungal, 
yeast, and mammalian cellular hosts are described, for example, in Pouwels et al. 
5 Cloning Vectors: A Laboratory Manual Elsevier, New York, ( 1 985). Cell-free 
translation systems could also be employed to produce polypeptides using RNAs 
derived from DNA constructs disclosed herein. 
Prokaryotic Systems 
Prokaryotes include gram-negative or gram-positive organisms. Suitable 
10 prokaryotic host cells for transformation include, for example, E. coli, Bacillus 
subtilis, Salmonella typhimurium, and various other species within the genera 
Pseudomonas, Streptomyces, and Staphylococcus. In a prokaryotic host cell, such as 
E. coli, a polypeptide may include an N-terminal methionine residue to facilitate 
expression of the recombinant polypeptide in the prokaryotic host cell. The N- 
15 terminal Met may be cleaved from the expressed recombinant polypeptide. 

Expression vectors for use in prokaryotic host cells generally comprise one or 
more phenotypic selectable marker genes. A phenotypic selectable marker gene is, 
for example, a gene encoding a protein that confers antibiotic resistance or that 
supplies an autotrophic requirement. Examples of useful expression vectors for 
20 prokaryotic host cells include those derived from commercially available plasmids 
such as the cloning vector pBR322 (ATCC 37017). pBR322 contains genes for 
ampicillin and tetracycline resistance and thus provides simple means for identifying 
transformed cells. An appropriate promoter and a DNA sequence are inserted into the 
pBR322 vector. Other commercially available vectors include, for example, pKK223- 
25 3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and pGEMl (Promega Biotec, 
Madison, WI, USA). 

Promoter sequences commonly used for recombinant prokaryotic host cell 
expression vectors include P-lactamase (penicillinase), lactose promoter system 
(Chang et al., Nature 275:615, 1978; and Goeddel et al., Nature 281:544, 1979), 
30 tryptophan (trp) promoter system (Goeddel et al., NucL Acids Res. 5:4057, 1980; and 
EP-A-36776) and tac promoter (Maniatis, Molecular Cloning: A Laboratory Manual, 
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Cold Spring Harbor Laboratory, p. 412, 1982). A particularly useful prokaryotic host 
cell expression system employs a phage XP L promoter and a cIS57ts thermolabile 
repressor sequence. Piasmid vectors available from the American Type Culture 
Collection which incorporate derivatives of the XP L promoter include piasmid pHUB2 
(resident in E coli strain JMB9. ATCC 37092) and P PLc2S (resident in E coli RR1. 
ATCC 53082). 

Yeast Systems 

Alternatively, the polypeptides may be expressed in yeast host cells, 
preferably from the Saccharomyces genus (e.g., S. cerevisiae). Other genera of yeast, 
such as Pichia or Kluyveromyces. may also be employed. Yeast vectors will often 
contain an origin of replication sequence from a 2u yeast piasmid, an autonomously 
replicating sequence (ARS), a promoter region, sequences for polyadenylation, 
sequences for transcription termination, and a selectable marker gene. Suitable 
promoter sequences for yeast vectors include, among others, promoters for 
metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255:2073, 
1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7:149, 1968; and 
Holland et al., Biochem. 7 7:4900, 1978), such as enolase, glyceraldehyde-3 -phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phospho-glucose isomerase, and glucokinase. Other suitable vectors and 
promoters for use in yeast expression are further described in Hitzeman, EPA-73,657. 
Another alternative is the glucose-repressible ADH2 promoter described by Russell et 
al. (J. Biol. Chem. 258:261 '4, 1982) and Beier et al. (Nature 300:124, 1982). Shuttle 
vectors replicable in both yeast and E. coli may be constructed by inserting DNA 
sequences from pBR322 for selection and replication in E. coli (Amp' gene and 
origin of replication) into the above-described yeast vectors. 

The yeast a-factor leader sequence may be employed to direct secretion of the 
polypeptide. The a-factor leader sequence is often inserted between the promoter 
sequence and the structural gene sequence. See, e.g., Kurjan et al., Cell 30:933, 1982 
and Bitter et *U Proc. Natl. Acad. Sci. t/&4 ,57:5330, 1984. Other leader sequences 
suitable for facilitating secretion of recombinant polypeptides from yeast hosts are 
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known to those of skill in the art. A leader sequence may be modified near its 3' end 
to contain one or more restriction sites. This will facilitate fusion of the leader 
sequence to the structural gene. 

Yeast transformation protocols are known to those of skill in the art. One such 
5 protocol is described by Hinnen et al., Proc. Natl. Acad. ScL USA 75:1929, 197S. The 
Hinnen et al. protocol selects for Trp~ transformants in a selective medium, wherein 
the selective medium consists of 0.67% yeast nitrogen base, 0.5% casamino acids, 2% 
glucose, 10 mg/ml adenine and 20 mg/ml uracil. 

Yeast host cells transformed by vectors containing an ADH2 promoter 

10 sequence may be grown for inducing expression in a "rich" medium. An example of a 
rich medium is one consisting of 1% yeast extract, 2% peptone, and 1% glucose 
supplemented with 80 mg/ml adenine and 80 mg/ml uracil. Derepression of the 
ADH2 promoter occurs when glucose is exhausted from the medium. 
Mammalian or Insect Systems 

15 Mammalian or insect host cell culture systems also may be employed to 

express recombinant polypeptides. Baculo virus systems for production of 
heterologous proteins in insect cells are reviewed by Luckow and Summers, 
Bio/Technology 6:47 (1988). Established cell lines of mammalian origin also may be 
employed. Examples of suitable mammalian host cell lines include the COS-7 line of 

20 monkey kidney cells (ATCC CRL 1651) (Gluzman et ah, Cell 25:175, 1981), L cells, 
CI 27 cells, 3T3 cells (ATCC CCL 163), Chinese hamster ovary (CHO) cells, HeLa 
cells, and BHK (ATCC CRL 10) cell lines, and the CV1/EBNA cell line derived from 
the African green monkey kidney cell line CV1 (ATCC CCL 70) as described by 
McMahan et al. {EMBO J. 10: 2821, 1991). 

25 Established methods for introducing DNA into mammalian cells have been 

described (Kaufman, R.J., Large Scale Mammalian Cell Culture^ 1990, pp. 15-69). 
Additional protocols using commercially available reagents, such as Lipofectamine 
lipid reagent (Gibco/BRL) or Lipofectamine-Plus lipid reagent, can be used to 
transfect cells (Feigner et al., Proc. Natl Acad. ScL USA 54:7413-7417, 1987). In 

30 addition, electroporation can be used to transfect mammalian cells using conventional 
procedures, such as those in Sambrook et al. {Molecular Cloning: A Laboratory 
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Manual, 2 ed. Vol. 1-3, Cold Spring Harbor Laboratory Press, 1989). Selection of 
stable transformants can be performed using methods known in the art, such as, for 
example, resistance to cytotoxic drugs. Kaufman et al., Meth. in Enzvmology 
755:487-51 1, 1990, describes several selection schemes, such as dihydro folate 
5 reductase (DHFR) resistance. A suitable host strain for DHFR selection can be CHO 
strain DX-B1 I. which is deficient in DHFR (Urlaub and Chasin. Proc. Natl. Acad. 
ScL USA 77:4216-4220. 1980). A plasmid expressing the DHFR cDNA can be 
introduced into strain DX-BI 1, and only cells that contain the plasmid can grow in the 
appropriate selective media. Other examples of selectable markers that can be 
10 incorporated into an expression vector include cDNAs conferring resistance to 
antibiotics, such as G418 and hygromycin B. Cells harboring the vector can be 
selected on the basis of resistance to these compounds. 

Transcriptional and translational control sequences for mammalian host cell 
expression vectors can be excised from viral genomes. Commonly used promoter 
15 sequences and enhancer sequences are derived from polyoma virus, adenovirus 2, 
simian virus 40 (S V40), and human cytomegalovirus. DNA sequences derived from 
the SV40 viral genome, for example, SV40 origin, early and late promoter, enhancer, 
splice, and polyadenylation sites can be used to provide other genetic elements for 
expression of a structural gene sequence in a mammalian host cell. Viral early and 
20 late promoters are particularly useful because both are easily obtained from a viral 

genome as a fragment, which can also contain a viral origin of replication (Fiers et aL, 
Nature 273:\U 7 1978; Kaufinan, Meth. in Enzymology, 1990). Smaller or larger 
SV40 fragments can also be used, provided the approximately 250 bp sequence 
extending from the Hind III site toward the Bgl I site located in the S V40 viral origin 
25 of replication site is included. 

Additional control sequences shown to improve expression of heterologous 
genes from mammalian expression vectors include such elements as the expression 
augmenting sequence element (EASE) derived from CHO cells (Morris et al., Animal 
Cell Technology, 1997, pp. 529-534 and PCT Application WO 97/25420) and the 
30 tripartite leader (TPL) and VA gene RNAs from Adenovirus 2 (Gingeras et al., J. 
Biol Chem. 257:13475-13491, 1982). The internal ribosome entry site (IRES) 
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sequences of viral origin allows dicistronic mRNAs to be translated efficiently (Oh 
and Sarnow, Current Opinion in Genetics and Development 3:295-300. 1993; Ramesh 
et al. ? Nucleic Acids Research 24:2697-2700, 1996). Expression of a heterologous 
cDNA as part of a dicistronic mRNA followed by the gene for a selectable marker 
5 (e.g. DHFR) has been shown to improve transfectability of the host and expression of 
the heterologous cDNA (Kaufman, Meth. in Enzymology, 1990). Exemplary 
expression vectors that employ dicistronic mRNAs are pTR-DC/GFP described by 
Mosser et al. ? Biotechmques 22: 150-161, 1997, and p2A5I described by Morris et al., 
Animal Cell Technology, 1997, pp. 529-534. 
10 A useful high expression vector, pCAVNOT, has been described by Mosley et 

al., Cell 59:335-348, 1989. Other expression vectors for use in mammalian host cells 
can be constructed as disclosed by Okayama and Berg {Mol Cell Biol. 3:280, 1983). 
A useful system for stable high level expression of mammalian cDNAs in CI 27 
murine mammary epithelial cells can be constructed substantially as described by 
15 Cosman et al. {Mol Immunol 23:935, 1986). A useful high expression vector, 
PMLSV N1/N4, described by Cosman et al., Nature 372:768, 1984, has been 
deposited as ATCC 39890. Additional useful mammalian expression vectors are 
described in EP-A-0367566, and in WO 91/18982, incorporated by reference herein. 
In yet another alternative, the vectors can be derived from retroviruses. 
20 Another useful expression vector, pFLAG®, can be used. FLAG® technology 

is centered on the fusion of a low molecular weight (IkD), hydrophilic, FLAG® 
marker peptide to the N-terminus of a recombinant protein expressed by pFLAG® 
expression vectors. 

Regarding signal peptides that may be employed, the native signal peptide 
25 may be replaced by a heterologous signal peptide or leader sequence, if desired. The 
choice of signal peptide or leader may depend on factors such as the type of host cells 
in which the recombinant polypeptide is to be produced. To illustrate, examples of 
heterologous signal peptides that are functional in mammalian host cells include the 
signal sequence for interleukin-7 (IL-7) described in United States Patent 4,965,195; 
30 the signal sequence for interleukin-2 receptor described in Cosman et al., Nature 

312:768 (1984); the interleukin-4 receptor signal peptide described in EP 367,566; the 
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type I interleukin-1 receptor signal peptide described in U.S. Patent 4,968,607; and 
the type H interleukin-1 receptor signal peptide described in EP 460,846. 
Purification 

The invention also includes methods of isolating and purifying the 
5 polypeptides and fragments thereof 

Isolation and Purification 
The "isolated" polypeptides or fragments thereof encompassed by this 
invention are polypeptides or fragments that are found in an environment that is not 
identical to their natural environment. The "purified" polypeptides or fragments 
10 thereof encompassed by this invention are essentially free of association with other 
proteins or polypeptides, for example, as a purification product of recombinant 
expression systems such as those described above or as a purified product from a non- 
recombinant source such as naturally occurring cells and/or tissues. 

In one preferred embodiment, the purification of recombinant polypeptides or 
15 fragments can be accomplished using fusions of polypeptides or fragments of the 
invention to another polypeptide to aid in the purification of polypeptides or 
fragments of the invention. Such fusion partners can include the poly-His or other 
antigenic identification peptides described above as well as the Fc moieties described 
previously. 

20 With respect to any type of host cell, as is known to the skilled artisan, 

procedures for purifying a recombinant polypeptide or fragment will vary according 
to such factors as the type of host cells employed and whether or not the recombinant 
polypeptide or fragment is secreted into the culture medium. 

In general, the recombinant polypeptide or fragment can be isolated from the 

25 host cells if not secreted, or from the medium or supernatant if soluble and secreted, 
followed by one or more concentration, salting-out, ion exchange, hydrophobic 
interaction, affinity purification or size exclusion chromatography steps. As to 
specific ways to accomplish these steps, the culture medium first can be concentrated 
using a commercially available protein concentration filter, for example, an Amicon 

30 or Millipore Pellicon ultrafiltration unit. Following the concentration step, the 

concentrate can be applied to a purification matrix such as a gel filtration medium. 
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Alternatively, an anion exchange resin can be employed, for example, a matrix 
or substrate having pendant diethylaminoethyl (DEAE) groups. The matrices can be 
acrylamide, agarose, dextran. celluiose or other types commonly employed in protein 
purification. Alternatively, a cation exchange step can be employed. Suitable cation 
5 exchangers include various insoluble matrices comprising sulfopropyl or 

carboxvmethyl groups. In addition, a chromatofocusing step can be employed. 
Alternatively, a hydrophobic interaction chromatography step can be employed. 
Suitable matrices can be phenyl or octyl moieties bound to resins. In addition, affinity 
chromatography with a matrix which selectively binds the recombinant protein can be 
10 employed. Examples of such resins employed are lectin columns, dye columns, 

antibody columns, and metal-chelating columns. Finally, one or more reversed-phase 
high performance liquid chromatography (RP-HPLC) steps employing hydrophobic 
RP-HPLC media, (e.g., silica gel or polymer resin having pendant methyl, octyl, 
octyldecyl or other aliphatic groups) can be employed to further purify the 
15 polypeptides. Some or all of the foregoing purification steps, in various 

combinations, are well known and can be employed to provide an isolated and 
purified recombinant protein. 

It is also possible to utilize an affinity column comprising a polypeptide- 
binding protein of the invention, such as a monoclonal antibody generated against 
20 polypeptides of the invention, to affinity-purify expressed polypeptides. These 

polypeptides can be removed from an affinity column using conventional techniques, 
e.g., in a high salt elution buffer and then dialyzed into a lower salt buffer for use or 
by changing pH or other components depending on the affinity matrix utilized, or be 
competitively removed using the naturally occurring substrate of the affinity moiety, 
25 such as a polypeptide derived from the invention. 

In this aspect of the invention, polypeptide-binding proteins, such as the anti- 
polypeptide antibodies of the invention or other proteins that may interact with the 
polypeptide of the invention, can be bound to a solid phase support such as a column 
chromatography matrix or a similar substrate suitable for identifying, separating, or 
30 purifying cells that express polypeptides of the invention on their surface. Adherence 
of polypeptide-binding proteins of the invention to a solid phase contacting surface 
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can be accomplished by any means, for example, magnetic microspheres can be 
coated with these polypeptide-binding proteins and held in the incubation vessel 
through a magnetic field. Suspensions of eel i mixtures arc contacted with the solid 
phase that has such polypeptide-binding proteins thereon. Cells having polypeptides 
of the invention on their surface bind to the fixed polypeptide-binding protein and 
unbound cells then are washed away. This affinity-binding method is useful for 
purifying, screening, or separating such polypeptide-expressing cells from solution. 
Methods of releasing positively selected cells from the solid phase are known in the 
art and encompass, for example, the use of enzymes. Such enzymes are preferably 
non-toxic and non-injurious to the cells and are preferably directed to cleaving the 
cell-surface binding partner. 

Alternatively, mixtures of cells suspected of containing polypeptide- 
expressing cells of the invention first can be incubated with a biotinylated 
polypeptide-binding protein of the invention. Incubation periods are typically at least 
one hour in duration to ensure sufficient binding to polypeptides of the invention. 
The resulting mixture then is passed through a column packed with avidin-coated 
beads, whereby the high affinity of biotin for avidin provides the binding of the 
polypeptide-binding cells to the beads. Use of avidin-coated beads is known in the 
art. See Berenson, et al. J. Cell Biochem., 10D:239 (1986). Conventional methods 
are used to wash the unbound material from the column and to release bound cells 
from the column. 

The desired degree of purity depends on the intended use of the protein. A 
relatively high degree of purity is desired when the polypeptide is to be administered 
in vivo, for example. In such a case, the polypeptides are purified so that no protein 
bands corresponding to other proteins are detectable upon analysis by SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE). It will be recognized by one skilled 
in the pertinent field that multiple bands corresponding to the polypeptide may be 
visualized by SDS-PAGE, due to differential post-translational modification, 
processing, and the like. Most preferably, the polypeptide of the invention is purified 
to substantial homogeneity, as indicated by a single protein band upon analysis by 
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SDS-PAGE. The protein band may be visualized by silver staining, Coomassie blue 

staining, or (if the protein is radiolabeled) by autoradiography. 

USE OF GNX/sGNX NUCLEIC ACID OR OLIGONUCLEOTIDES 

In addition to being used to express polypeptides as described above, the 
5 nucleic acids of the invention, including DNA, and oligonucleotides thereof can be 
used: 

as probes to identify nucleic acid encoding proteins having the 
ability to regulate angiogenesis; 

as probes to identify nucleic acid encoding protein agonists and 
10 antagonists or sGNK and related signaling pathways; 

as single-stranded sense or antisense oligonucleotides, to inhibit 

expression of polypeptide encoded by the GNK or sGNK gene; 

to further elucidate and characterize the biological activities of GNK 

and sGNK; and 
15 for gene therapy. 

Probes to Identify DNA Sequences Encoding Proteins Related to the 
Regulation of Vascularization 

Among the uses of nucleic acids of the invention is the use of fragments as 
probes or primers. Such fragments generally comprise at least about 17 contiguous 
20 nucleotides of a DNA sequence. In other embodiments, a DNA fragment comprises 
at least 30, or at least 60, contiguous nucleotides of a DNA sequence. 

Because homologs of SEQ ID NO: 1 or SEQ ID NO: 3, from other 
mammalian species, are contemplated herein, probes based on the human DNA 
sequence of SEQ ID NO: 1 or SEQ ID NO: 3 may be used to screen cDNA libraries 
25 derived from other mammalian species, using conventional cross-species 
hybridization techniques. 

Using knowledge of the genetic code in combination with the amino acid 
sequences set forth above, sets of degenerate oligonucleotides can be prepared. Such 
oligonucleotides are useful as primers, e.g., in polymerase chain reactions (PCR), 
30 whereby DNA fragments are isolated and amplified. 
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Probes to Identi fy Specific Agonists and Antagonists of sGNK 
The present invention also provides methods of detecting agonists and 
antagonists or sGNK and the GNK-sGNfK complex. In one embodiment, the 
invention thus generally provides a method for identifying gene products that 
5 associate with sGNK comprising: (a) introducing nucleic acid sequences encoding a 
sGNK. or fragment thereof, into a first expression vector such that sGNK sequences 
are expressed as part of a fusion protein comprising a functionally incomplete first 
portion of a protein that is essential to the viability of a host cell; (b) introducing the 
nucleic acid sequences encoding a plurality of candidate gene products that interact or 
1 0 associate with sGNK into a second expression vector such that any candidate gene 
products are expressed as part of a fusion protein comprising a second functionally 
incomplete portion of the protein that is essential to the viability of the host cell; (c) 
introducing the first and second expression vectors into a host cell under suitable 
conditions and for a sufficient time so that host cell survival depends upon the 
15 reconstitution of both first and second functionally incomplete portions of the protein 
(that is essential to the viability of the host cell) into a functionally complete protein; 
and (d) identifying the nucleic acid sequences encoding the candidate gene products 
that associate with sGNK in the second expression vector. 

For example, the yeast two-hybrid system (Fields and Song, Nature 340:245, 
20 1989; U.S. Patent No. 5,283,173) can be used to detect interactions between sGNK 
and other proteins or between sGNK and selected compounds, or pools of 
compounds, that are suspected of increasing or decreasing the activity of sGNK or of 
otherwise employing sGNK to transduce a biological signal. Such interactions can be 
detected by screening for functional reconstitution of a yeast transcription factor. 
25 Briefly, the yeast two hybrid system was developed as a way to test whether 

two proteins associate or interact directly with each other and was then modified to 
serve as a method to "capture" candidate proteins that interact with a known protein 
of interest or "bait". The bait protein is expressed as a fusion protein with the DNA- 
binding domain of GAL4, a yeast transcription factor, in a specially designed yeast 
30 strain (Yl 90) containing reporter genes under GAL4 control. (Durfee et al., Genes & 
Devel. 7:555, 1993). GAL 4 is a modular yeast transcription factor with the DNA 
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binding domain confined to the N-terminal 147 residues while the transcriptional 
activation function resides entirely in the C-terminal 1 14 residues. Libraries used in 
the two-hybrid system have clones expressing GAL4 activation domain fusion 
proteins. The method detects the reconstitution of GAL4 function when two fusion 
5 proteins encode proteins that associate with each other, so that the DNA-binding 
domain fusion recruits the activation domain fusion into position at the GAL4 
promoter, leading to transcriptional activation of the GAL^-controlled reporter genes. 

The sGNK nucleic acid sequences disclosed herein can be cloned into a 
suitable vector carrying the DNA-binding domain of GAL4 and transformed into an 
10 appropriate yeast strain to produce yeast cells which express a GAL 4 DNA-binding 
domain/sGNK region fusion protein using methods well known in the art. Activation 
domain cDNA libraries can then be screened in appropriate vectors. A positive signal 
in such a two-hybrid assay can result from cDNA clones that encode proteins that 
specifically associate with sGNK, such as substrates or activators of sGNK. 
15 Knowledge of proteins that associate with sGNK can also permit searching for 
inhibitors of downstream signaling pathways. 

The functional interaction between sGNK and its associating proteins also 
permits screening for small molecules that interfere with the GNK/sGNK, 
sGNK/substrate, or sGNK/activator association and thereby inhibit signal 
20 transduction via the GNK- sGNK pathway. For example, the yeast two-hybrid system 
can be used to screen for signaling pathway inhibitors as follows. 

sGNK and its activator/inhibitor, or portions thereof responsible for their 
interaction, can be fused to the GAL4 DNA binding domain and GAL4 transcriptional 
activation domain, respectively, and introduced into a strain that depends on GAL4 
25 activation for growth on plates lacking histidine. Compounds that prevent growth can 
be screened in order to identify inhibitors of the GNK-sGNK pathway or sGNK 
biological activity. Alternatively, the screen can be modified so that sGNK/activator 
or sGNK/substrate interaction inhibits growth, so that inhibition of the interaction 
allows growth to occur. Another approach to in vitro screening for inhibition of 
30 sGNK biological activity would be to immobilize one of the components, such as 
sGNK, or portions thereof, in wells of a microti ter plate, and to couple an easily 
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detected indicator to the other component. An inhibitor of the interaction is identified 
by the absence of the detectable indicator in the well. 

A high throughput screening assay can also be utilized to identity compounds 
that inhibit sGNK activity. For example, natural product extracts, from plant and 
5 marine sources, as well as microbial fermentation broths, can be sources of kinase 
inhibitors and can be screened for potential sGNK antagonists. Other sources of 
sGNK antagonists include pre-existing or newly generated libraries of small orsanic 
molecules and pre-existing or newly generated combinatorial chemistry libraries. 
Identification of endogenous sGNK substrate(s), and mapping of their interactive 
1 0 site(s) to determine their specific recognition motif(s), can enable the development of 
peptide mimetic inhibitors. In addition, in vivo regulation of sGNK activity likely 
involves endogenous protein inhibitor(s), which can be identified using the assay(s) 
described herein. 

These assays also facilitate the identification of other molecules that interact 
1 5 with sGNK in a physiologically relevant manner, such as endogenous substrates, 

activators and the aforementioned natural protein inhibitors. Such molecules include, 
but are not limited to, receptors and receptor associated polypeptides, guanine 
nucleotide binding proteins (G proteins), GEFs, guanine nucleotide activating proteins 
(GAPs), transcription activators, and repressors. Additionally, the sGNK assays can 
20 serve as readouts to identify other enzymes involved in the signaling cascade, such as 
other kinases, phosphatases, and phospho lipases. 
Sense- Antisense 

Other useful fragments of the nucleic acids include antisense or sense 
oligonucleotides comprising a single-stranded nucleic acid sequence (either RNA or 

25 DNA) capable of binding to target mRNA (sense) or DNA (antisense) sequences. 
Antisense or sense oligonucleotides, according to the present invention, comprise a 
fragment of SEQ ID NO: 1 or SEQ ID NO: 3. Such a fragment generally comprises 
at least about 14 nucleotides, preferably from about 14 to about 30 nucleotides. The 
ability to deriVe an antisense or a sense oligonucleotide, based upon a cDNA sequence 

30 encoding a given protein is described in, for example, Stein and Cohen {Cancer Res. 
48:2659, 1988) and van der Krol et al. {Bio/Techniques 6:958, 1988). 
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Binding of antisense or sense oligonucleotides to target nucleic acid sequences 
results in the formation of duplexes that block or inhibit protein expression by one of 
several means, including enhanced degradation of the mRNA by RNAse FL inhibition 
of splicing, premature termination of transcription or translation, or by other means. 
5 .Antisense or sense oligonucleotides further comprise oligonucleotides having 
modified sugar-phosphodiester backbones (or other sugar linkages, such as those 
described in W09 1/06629) and wherein such sugar linkages are resistant to 
endogenous nucleases. Such oligonucleotides with resistant sugar linkages are stable 
in vivo (i.e., capable of resisting enzymatic degradation) but retain sequence 

10 specificity to be able to bind to target nucleotide sequences. 

Other examples of sense or antisense oligonucleotides include those 
oligonucleotides which are covalently linked to organic moieties, such as those 
described in WO 90/10448, and other moieties that increases affinity of the 
oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further 

15 still, intercalating agents, such as ellipticine, and alkylating agents or metal complexes 
may be attached to sense or antisense oligonucleotides to modify binding specificities 
of the antisense or sense oligonucleotide for the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing 
the target nucleic acid sequence by any gene transfer method, including, for example, 

20 lipofection, CaP0 4 -mediated DNA transfection, electroporation, or by using gene 
transfer vectors such as Epstein-Barr virus. 

Sense or antisense oligonucleotides also may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a ligand 
binding molecule, as described in WO 91/04753. Suitable ligand binding molecules 

25 include, but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. Preferably, conjugation of the 
ligand binding molecule does not substantially interfere with the ability of the ligand 
binding molecule to bind to its corresponding molecule or receptor, or block entry of 
the sense or antisense oligonucleotide or its conjugated version into the cell. 

30 Alternatively, a sense or an antisense oligonucleotide may be introduced into a 

cell containing the target nucleic acid sequence by formation of an oligonucleotide- 
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lipid complex, as described in WO 90/10448. The sense or antisense oligonucleotide- 
lipid complex is preferably dissociated within the cell by an endogenous lipase. 

Further elucidation and characterization of the biological activities of 
GNK and sGNK 

5 The materials and methods of the present invention may be used to prepare 

cells, embryos, fetuses, and animals that are heterozygous (+/-) or homozygous (-/-) 
for a GNK or sGNX targeted mutation. These cells, embryos, fetuses, and animals 
are useful for demonstrating the role of GNK and/or sGNK in vascularization and for 
demonstrating other biological activities of GNK and/or sGNK. The skilled artisan 

10 will realize that a variety of methods may be used to generate cells, embryos, fetuses, 
and animals with alterations in the expression of GNK and/or sGNK. These methods 
include generating targeted mutations and knockouts, attenuating gene expression 
using antisense, ribozyme, or small molecule technology, and attenuating or 
activating gene expression using a Zn finger approach such as that described by Segal 

15 et ah (Proc. Nat. Acad. Sci. USA 96(6):2758, 1999). For the latter approach, Zn 
fingers are targeted 5* or 3 f of the coding portion of GNK or sGNK, or within an 
intron, allowing the novel construction of a novel gene switch. 
Gene Therapy 

The invention also provides expression vectors useful in gene therapy 
20 applications. Appropriate expression vectors are readily constructed by those skilled 
in the art and may be used for gene therapy using retroviral vector constructs or may 
be developed and utilized with other viral constructs including, for example, 
poliovirus (Evans et al., Nature 339:385, 1989; Sabin, J. Biol. Standard. 1:115, 1973); 
rhinovirus; poxviruses, such as canary pox or vaccinia virus (Fisher-Hoch et aL, Proc. 
25 Natl. Acad. Sci. USA 86:317, 1989; Flexner et zUAnn. NY. Acad. Sci. 569:86, 1989; 
Flexner et aL, Vaccine 8:17, 1990; U.S. Patent Nos. 4,603,1 12 and 4,769,330; WO 
89/01973); polyoma viruses such as SV40 (Mulligan et al., Nature 277:108, 1979); 
influenza virus (Luytjes et aL, Cell 59:1 107, 1989; McMichael et aL, N. Eng. J. Med. 
309:13, 1983: Yap et aL, Nature 273:238, 1978); adenoviruses (Berkner, 
30 Biotechniques 6:616, 1988; Rosenfeld et aL, Science 252:43 1, 1991); parvoviruses 

such as adeno-associated virus (Samulski et al.,7. Virol 63:3822, 1989; Mendelson et 
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al., Virol. 166:154, 1988) and herpes viruses (Kit, Adv. Exp. Med. Biol. 215:219, 
1989). 

Once a vector has been prepared, it may be therapeutically administered by 
well known methods, for example, by direct administration, or via transfection 
5 utilizing physical methods, such as lipofection (Feigner et ah, Proc. Natl. Acad. Sci. 
USA 84:7413, 1989), direct DNA injection (Ascadi et al., Nature 352:815, 1991), 
microprojectile bombardment (Williams et al., Proc. Natl. Acad. Sci. USA 88:2726, 
1991), liposomes (Wang et al., Proc. Natl. Acad. Sci. USA S4:7S51, 1987), calcium 
phosphate (Dubensky et al., Proc. Natl Acad. Sci. USA 81:7529, 1984), or DNA 
10 ligand (Wu et al., J. Biol Chem. 264:16985, 1989). 

USE OF GNK AND/OR sGNK POLYPEPTIDES AND FRAGMENTED 
POLYPEPTIDES 

Uses include, but are not limited to, the following: 
Purification Reagents 
15 - Measuring Biological Activity 

Identification of Agonists or Antagonists of GNK or sGNK 
Identification of Unknown Proteins 
Antibodies 
Therapeutic Agents 

20 Purification Reagents 

The GNK or sGNK polypeptides of the invention find use as protein 
purification reagents. For example the GNK polypeptides may be attached to a solid 
support material and used to purify sGNK proteins by affinity chromatography, or 
vice versa, i.e., the sGNK is attached to a solid support and used to purify GNK. In 

25 particular embodiments, a sGNK polypeptide (in any form described herein that is 

capable of binding GNK), or vice versa, is attached to a solid support by conventional 
procedures. As one example, chromatography columns containing functional groups 
that will react with functional groups on amino acid side chains of proteins are 
available (Pharmacia Biotech, Inc., Piscataway, NJ). In an alternative, a 
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polypeptide/Fc protein (as discussed above) is attached to Protein A- or Protein G- 
containing chromatography columns through interaction with the Fc moiety. 

In addition to purification, such affinity columns can be used to select and 
isolate previously unidentified binding proteins, moieties, and/or co factors. 
5 Measuring Biological Activity 

GNK polypeptides also find use in measuring the biological activity of sGNK 
protein in terms of their binding affinity, and vice versa. The polypeptides thus may 
be employed by those conducting "quality assurance" studies, e.g.. to monitor shelf 
life and stability of protein under different conditions. For example, the GNK 
1 0 polypeptides may be employed in a binding affinity study to measure the biological 
activity of sGNK that has been stored at different temperatures, or produced in 
different cell types, or vice versa. The sGNK may also be used to determine whether 
biological activity is retained after modification of GNK (e.g., chemical modification, 
truncation, mutation, etc.), and vice versa. The binding affinity of the modified 
1 5 GNK/sGNK protein is compared to that of an unmodified GNK/sGNK protein to 

detect any adverse impact of the modifications on biological activity of GNK/sGNK. 
The biological activity of a GNK/sGNK protein thus can be ascertained before it is 
used in a research study of angiogenesis, for example. 

Identification of Ag onists and Antagonists of GNK or sGNK - 
20 The polypeptides of the present invention may also be used in a screening 

assay to identify compounds and small molecules which inhibit (antagonize) or 
enhance (agonize) activation of the polypeptides of the instant invention. Thus, for 
example, polypeptides of the invention may be used to identify antagonists and 
agonists from cells, cell-free preparations, chemical libraries, and natural product 
25 mixtures. The antagonists and agonists may be natural or modified substrates, 

ligands, enzymes, receptors, etc. of the polypeptides of the instant invention, or may 
be structural or functional mimetics of the polypeptides. Potential antagonists of the 
polypeptides of the instant invention may include small molecules, peptides, and 
antibodies that bind to and occupy a binding site of the polypeptides, causing them to 
30 be unavailable to bind to their ligands and therefore preventing normal biological 
activity. Other potential antagonists are antisense molecules which may hybridize to 
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mRNA in vivo and block translation of the mRNA into the polypeptides of the instant 
invention. Potential agonists include small molecules, peptides and antibodies which 
bind to the instant polypeptides and elicit the same or enhanced biological effects as 
those caused by the binding of the polypeptides of the instant invention. 
5 Small molecule agonists and antagonists are usually less than 1 OK molecular 

weight and may possess a number of physiochemical and pharmacological properties 
that enhance cell penetration, resist degradation and prolong their physiological 
half-lives. (Gibbs, J., Pharmaceutical Research in Molecular Oncology. Cell, Vol. 79 
(1994).) Antibodies, which include intact molecules as well as fragments such as Fab 
10 and F(ab')2 fragments, may be used to bind to and inhibit the polypeptides of the 
instant invention by blocking the commencement of a signaling cascade. It is 
preferable that the antibodies are humanized, and more preferable that the antibodies 
are human. The antibodies of the present invention may be prepared by any of a 
variety of well-known methods. 
15 Specific screening methods are known in the art and along with integrated 

robotic systems and collections of chemical compounds/natural products are 
extensively incorporated in high throughput screening so that large numbers of test 
compounds can be tested for antagonist or agonist activity within a short amount of 
time. These methods include homogeneous assay formats such as fluorescence 
20 resonance energy transfer, time resolved fluorescence resonance energy transfer, 
fluorescence polarization, scintillation proximity assays, reporter gene assays, 
fluorescence quenched enzyme substrate, chromogenic enzyme substrate and 
electrochemiluminesence, as well as, more traditional heterogeneous assay formats 
such as enzyme linked immunosorbent assays (ELISA) or radioimmunoassays. 
25 Homogeneous assays are mix and read style assays that are very amenable to robotic 
application, whereas heterogeneous assays require separation of free from bound 
analyte by more complex unit operations such as filtration, centrifiigation or washing. 
These assays are utilized to detect a wide variety of specific biomolecular interactions 
and the inhibition thereof by small organic molecules, including protein-protein, 
30 receptor-ligand, enzyme-substrate, etc. These assay methods and techniques are well 
known in the art (see, e.g., High Throughput Screening: The Discovery of Bioactive 
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Substances, John P. Devlin (ed.), Marcel Dekker, New York, 1997, ISBN: 0-8247- 

0067-8; http://www.lab-robotics.oru/: http://www.sbs0nline.0rg/V The screening 
assays of the present invention are amenable to high throughput screening of chemical 
libraries and are suitable for the identification of small molecule drug candidates, 
antibodies, peptides and other antagonists and/or agonists. 

One embodiment of a method for identifying molecules which antagonize or 
inhibit the polypeptides involves adding a candidate molecule to a medium which 
contains cells that express the polypeptides of the instant invention; changing the 
conditions of said medium so that, but for the presence of the candidate molecule, the 
polypeptides would be bound to their ligands; and observing the binding and 
stimulation or inhibition of a functional response. The activity of the cells which 
were contacted with the candidate molecule may then be compared with the identical 
cells which were not contacted and agonists and antagonists of the polypeptides of the 
instant invention may be identified. The measurement of biological activity may be 
performed by a number of well-known methods such as measuring the amount of 
protein present (e.g. an ELISA) or of the protein's activity. A decrease in biological 
stimulation or activation would indicate an antagonist. An increase would indicate an 
agonist. 

Screening assays can further be designed to find molecules that mimic the 
biological activity of the polypeptides of the instant invention. Molecules which 
mimic the biological activity of a polypeptide may be useful for enhancing the 
biological activity of the polypeptide. To identify compounds for therapeutically 
active agents that mimic the biological activity of a polypeptide, it must first be 
determined whether a candidate molecule binds to the polypeptide. A binding 
candidate molecule is then added to a biological assay to determine its biological 
effects. The biological effects of the candidate molecule are then compared to the 
those of the polypeptide. 

Identification of Unknown Proteins 

As set forth above, a polypeptide or peptide fingerprint can be entered into or 
compared to a database of known proteins to assist in the identification of the 
unknown protein using mass spectrometry (W.J. Henzel et al., Proc. Natl Acad. Set 
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USA 90:501 1-5015, 1993; D. Fenyo et al M Electrophoresis 19:998-1005, 1998). A 
variety of computer software programs to facilitate these comparisons are accessible 
via the Internet, such as Protein Prospector (Internet site: prospector.uscf.edu), ■ 
Multildent (Internet site: w\v\v.expasy.ch/'sprot/multiident.html), PeptideSearch 

5 (Internet site: www.mann.embl-heiedelberg.de... deSearch/FR_PeptideSearch 

Form.html), and ProFound (Internet site: www.chait-sgi.rockefeiler.edu/cgi-bin/prot- 
id-frag.html). These programs allow the user to specify the cleavage agent and the 
molecular weights of the fragmented peptides within a designated tolerance. The 
programs compare these molecular weights to protein databases to assist in 

10 determining the identity of the unknown protein. 

In addition, a polypeptide or peptide digest can be sequenced using tandem 
mass spectrometry (MS/MS) and the resulting sequence searched against databases 
(J.K. Eng, et aL, J. Am. Soc. Mass Spec. 5:976-989 (1994); M. Mann and M. Wilm, 
Anal Chem. 66:4390-4399 (1994); J.A. Taylor and R.S. Johnson, Rapid Comm. Mass 

15 Spec. 1 1 :1067-1075 (1997)). Searching programs that can be used in this process 
exist on the Internet, such as Lutefisk 97 (Internet site: 

www.lsbc.com:70/Lutefisk97.html), and the Protein Prospector, Peptide Search and 
ProFound programs described above. Therefore, adding the sequence of a gene and 
its predicted protein sequence and peptide fragments to a sequence database can aid in 
20 the identification of unknown proteins using tandem mass spectrometry. 
Antibodies 

Antibodies that are immunoreactive with the polypeptides of the invention are 
provided herein. Such antibodies specifically bind to the polypeptides via the 
antigen-binding sites of the antibody (as opposed to non-specific binding). Thus, the 

25 polypeptides, fragments, variants, fusion proteins, etc., as set forth above may be 
employed as "immunogens" in producing antibodies immunoreactive therewith. 
More specifically, the polypeptides, fragment, variants, fusion proteins, etc. contain 
antigenic determinants or epitopes that elicit the formation of antibodies. 
These antigenic determinants or epitopes can be either linear or 

30 conformational (discontinuous). Linear epitopes are composed of a single section of 
amino acids of the polypeptide, while conformational or discontinuous epitopes are 
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composed of amino acid sections from different regions of the polypeptide chain that 
are brought into close proximity upon protein folding (C. A. Janeway. Jr. and P. 
Travers. Immuno Biology* 3:9 (Garland Publishing Inc., 2nd ed. 1996)). Because 
folded proteins have complex surfaces, the number of epitopes available is quite 
5 numerous; however, due to the conformation of the protein and steric hinderances, the 
number of antibodies that actually bind to the epitopes is less than the number of 
available epitopes (C. A. Janeway, Jr. and P. Travers, Immuno Biology* 2:14 (Garland 
Publishing Inc., 2nd ed. 1996)). Epitopes may be identified by any of the methods 
known in the art. 

10 Thus, one aspect of the present invention relates to the antigenic epitopes of 

the polypeptides of the invention. Such epitopes are useful for raising antibodies, in 
particular monoclonal antibodies, as described in more detail below. Additionally, 
epitopes from the polypeptides of the invention can be used as research reagents, in 
assays, and to purify specific binding antibodies from substances such as polyclonal 

15 sera or supernatants from cultured hybridomas. Such epitopes or variants thereof can 
be produced using techniques well known in the art such as solid-phase synthesis, 
chemical or enzymatic cleavage of a polypeptide, or using recombinant DNA 
technology. 

As to the antibodies that can be elicited by the epitopes of the polypeptides of 
20 the invention, whether the epitopes have been isolated or remain part of the 
polypeptides, both polyclonal and monoclonal antibodies may be prepared by 
conventional techniques. See, for example, Monoclonal Antibodies, Hybridomas: A 
New Dimension in Biological Analyses, Kennet et al. (eds.), Plenum Press, New York 
(1980); and Antibodies: A Laboratory Manual, Harlow and Land (eds.), Cold Spring 
25 Harbor Laboratory Press, Cold Spring Harbor, NY, (1988). 

Hybridoma cell lines that produce monoclonal antibodies specific for the 
polypeptides of the invention are also contemplated herein. Such hybridomas may be 
produced and identified by conventional techniques. One method for producing such 
a hybridoma cell line comprises immunizing an animal with a polypeptide; harvesting 
30 spleen cells from the immunized animal; fusing said spleen cells to a myeloma cell 
line, thereby generating hybridoma cells; and identifying a hybridoma cell line that 
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produces a monoclonal antibody that binds the polypeptide. The monoclonal 

antibodies may be recovered by conventional techniques. 

The monoclonal antibodies of the present invention include chimeric 
antibodies, e.g., humanized versions of murine monoclonal antibodies. Such 
5 humanized antibodies may be prepared by known techniques and offer the advantage 
of reduced immunogenicity when the antibodies are administered to humans. In one 
embodiment, a humanized monoclonal antibody comprises the variable region of a 
murine antibody (or just the antigen binding site thereof) and a constant region 
derived from a human antibody. Alternatively, a humanized antibody fragment may 
10 comprise the antigen binding site of a murine monoclonal antibody and a variable 
region fragment (lacking the antigen-binding site) derived from a human antibody. 
Procedures for the production of chimeric and further engineered monoclonal 
antibodies include those described in Riechmann et al. (Nature 352:323, 1988), Liu et 
al. (PNAS 54:3439, 1987), Larrick et al. (Bio/Technology 7:934, 1989), and Winter 
15 and Harris {TIPS 14:139, May, 1993). 

Procedures that have been developed for generating human antibodies in 
non-human animals may be employed in producing antibodies of the present 
invention. The antibodies may be partially human or preferably completely human. 
For example, transgenic mice into which genetic material encoding one or more 
20 human immunoglobulin chains has been introduced may be employed. Such mice 
may be genetically altered in a variety of ways. The genetic manipulation may result 
in human immunoglobulin polypeptide chains replacing endogenous immunoglobulin 
chains in at least some, and preferably virtually all, antibodies produced by the animal 
upon immunization. 

25 Mice in which one or more endogenous immunoglobulin genes have been 

inactivated by various means have been prepared. Human immunoglobulin genes 
have been introduced into the mice to replace the inactivated mouse genes. 
Antibodies produced in the animals incorporate human immunoglobulin polypeptide 
chains encoded by the human genetic material introduced into the animal. 
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Examples of techniques for the production and use of such transgenic animals 

are described in U.S. Patent Nos. 5,814.3 18, 5,569.825. and 5.545.806. which are 
incorporated by reference herein. 

Antigen-binding fragments of the antibodies, which may be produced by 
5 conventional techniques, are also encompassed by the present invention. Examples of 
such fragments include, but are not limited to. Fab and F(ab') 2 fragments. Antibody- 
fragments and derivatives produced by genetic engineering techniques are also 
provided. 

In one embodiment, the antibodies are specific for the polypeptides of the 
10 present invention and do not cross-react with other proteins. Screening procedures by 
which such antibodies may be identified are well known, and may involve 
immunoaffinity chromatography, for example. 
Therapeutic Agents 

When used as a therapeutic agent, sGNK, a sGNK antagonist, or a sGNK 
1 5 agonist can be formulated into pharmaceutical compositions according to known 
methods, either individually, in combination, or combined with GNK, an GNK 
agonist, or an GNK agonist (either individually or in combinations). The sGNK, its 
antagonist, or agonist can be introduced into the intracellular environment using 
methods well known in the field, such as encasing sGNK in liposomes or coupling 
20 sGNK to a monoclonal antibody targeted to a specific cell type. 

The sGNK, a sGNK antagonist, or a sGNK agonist can be combined in 
admixture, either as the sole active material or with other known active materials, with 
pharmaceutically suitable diluents (e.g., Tris-HCl, acetate, or phosphate buffers), 
preservatives (e.g., Thimerosol, benzyl alcohol, parabens), emulsifiers, solubilizers, 
25 adjuvants and/or carriers. Suitable carriers and their formulations are described in 
Remington's Pharmaceutical Sciences, 16 th ed. 1980, Mack Publishing Co. In 
addition, such compositions can contain sGNK, its antagonist, or its agonist, 
complexed with polyethylene glycol (PEG), metal ions, or incorporated into 
polymeric compounds such as polyacetic acid, polyglycolic acid, hydrogels, etc., or 
30 incorporated into liposomes, microemulsions, micelles, unilamellar or multilamellar 



BNSDOCfD: <WO 0036097 A2 I > 



WO 00/36097 PCT/US99/29989 

• -4*7- . 

vesicles, erythrocyte ghosts or'spheroplasts. Such combinations will influence the 

physical state, solubility, stability, rate of in vivo release, and rate of in vivo clearance, 

The following examples are provided to further illustrate particular 
embodiments of the invention, and are not to be construed as limiting the scope of the 
5 present invention. 

EXAMPLE 1 
Purification of sGNK 

This example describes purification of sGNK from rabbit lungs. Lungs were 
isolated from seventy New Zealand white rabbits intravenously injected with 1 00 
10 fig/ks of human recombinant IL-Lce, fifteen minutes prior to sacrifice. Following 
sacrifice, lungs were rapidly removed, washed in conventional ice cold phosphate 
buffered saline (cold PBS), immediately fast frozen, and stored at -80°C. The lungs 
were homogenized using a Brinkman tissue homogenizes Tissue and cellular debris 
was removed by centrifugation and ultrafiltration. The resulting supernatant was 
15 made 25% with respect to ammonium sulphate and proteins precipitated by this 0- 

25% salt cut were collected by centrifugation. Pelleted proteins were resuspended and 
sequentially subjected to the following purification steps: (1) ion-exchange 
chromatography using Source 15 Q (Pharmacia); (2) dye affinity chromatography 
using Reactive Green 19 (Sigma Chemicals); (3) size exclusion chromatography using 
20 Superdex 200 (Pharmacia); (4) affinity chromatography using heparin-sepharose 

(Pharmacia); (5) ion-exchange chromatography using Mono Q resin (Pharmacia); (6) 
size exclusion chromatography using SEC-400 (BioRad); (7) ion-exchange 
chromatography using a microbore Mono Q column; and electrophoretic separation 
using SDS-PAGE with 8-16% polyacrylamide gradient gels (Novex). 
25 The final chromatographic step, fractionation on a microbore MonoQ column 

containing 35 |il resin, was performed to concentrate the sample in a small volume for 
electrophoresis (Fig. 9). Briefly, fractions containing both GNK and sGNK from the 
SEC-400 chromatography step, were loaded onto the Mono Q column, previously 
equilibrated in 20 mM Tris-HCl, pH 8.5, 10 mM p-glycerophosphate, 1 mM 
30 dithiothreitol (DTT), 1 mM EDTA, 1 mM EGTA, 1 mM phenylmethyl sulphonyl 
fluoride (PMSF), 0.1 mM leupeptin, 10% glycerol and 0.1% NP-40 (Buffer A), at a 
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flow rate of 50 ul/min. After loading, the column was washed with 10 column 
volumes of Buffer A. Bound proteins, which included GNK and sGNTC. were eluted 
using a 500 ul linear 0-500 mM N'aCl gradient in Buffer A. 

The fraction containing GNK and sGNK was subjected to SDS-PAGE on an 
5 8- 1 6% Tris-glycine gradient gel (Novex). Bands were visualized by silver staining. 
sGNK was identified and excised from the gel. Trypsin digestion was performed in 
situ and resulting peptides were extracted by methods known in the art. The isolated 
peptides were analyzed by mass spectroscopy. The amino acid sequences of several 
peptides were ascertained and these sequences were utilized to design oligonucleotide 
10 probes for use in the molecular cloning of sGNK. 

EXAMPLE 2 
Cloning and Sequencing of Human sGNK 
The amino acid sequence for sGNK from the rabbit HBCK preparation was 
obtained using protease digestion and mass spectrometry, using methods known in the 
1 5 art. Three rabbit peptide sequences were found that matched a publicly- available 
human genomic expressed sequence containing 71 amino acids (Genbank accession 
number Tl 1454). Two oligonucleotide primers were used to screen cDNA libraries to 
determine whether they contained the message encoding the 71 amino acid sequence. 
These primers, identified as primer 21497 and 21499, were: 5'- 
20 GCCTTTGGACAAGCACACAC-3' (SEQ ID NO: 1 1) and 5'- 

CTCCTTCAGCTCCTGGGCC-3 1 (SEQ ID NO: 12), respectively. Several human 
cDNA libraries were found to be positive, including Raji (B cell), Clone 22 (T cell), 
KB (epithelial cell), NK cell (NK cell), human dermal fibroblasts (HDF), and WI26 
(lung fibroblast). 

To generate a template for a probe, the AgtlO KB library was amplified using 
primers 21497 and 21499. Amplifying 7.5 nanograms (ng) of the template with 
primer 21499 resulted in an 200 base pair single-stranded antisense PCR probe. This 
probe was used to screen 400,000 plaques from a human KB library made in AgtlO. 
Four positive plaques were identified. 



25 
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One of the positive plaques, clone KB-9-2-1, was mapped by PCR using all 

combinations of one vector and one insert primer. The vector primer sequences, 
referred to as U30 and D30, were: 

5'-CGAGCTGCTCTATAGACTGCTGGGTAGTCC-3' (SEQ ID NO: 13) and 
5 5*-TAACAGAGGTGGCTTATGAGTATTTCTTCC-3' (SEQ ID NO: 14), 
respectively. Primers 21497 and 21499 were used as insert primers. 

Sequencing of the PCR products revealed that product 2-2 (produced using 
primers U30 and 21499) extended the sequence in the 5 [ direction and that product 2-5 
(produced using primers D30 and 21497) extended the sequence in the 3' direction. A 

10 probe was made from PCR product 2-5 by digesting with EcoRI and amplification 
with primer 21498, 5 , -AAACCACAAGAAGGTGGCTG-3* (SEQ ID NO: 15). This 
probe was used to screen 500,000 plaques from a Raji cDNA library in A.gtl0. Four 
positive clones were picked and sequenced using conventional sequencing 
procedures. A Raji probe from one of these clones (Raji9-9-l A) was generated by 

15 amplifying with primers 23206 and 23207, S'-AGGTGAAGCGGCTGTCCCACGA- 
3' (SEQ ID NO: 16) and 

S'-CTCCTTCAGCTCCTGGGCCACAo' (SEQ ID NO: 17), respectively. 
Amplifying 24 ng of clone Raji9-9-l A using primer 23207 generated an antisense 35 
bp single-stranded probe. This probe was used to screen 500,000 plaques from a HDF 
20 library made in XgtlO. Three positive clones were identified and sequenced. 

The entire sGNK open reading frame is present in a composite of the four Raji 
clones and the three HDF clones. Further, Raji9-9-lA was found to be a full length 
clone. This clone was used as the template to make expression constructs. 

EXAMPLE 3 

25 Kinase assay demonstrating autophosphorylation 

of GNK and phoshorylation of sGNK by GNK 

GNK autophosphorylation was demonstrated for both purified rabbit and 
human recombinant GNK by incubating GNK in the presence of y- 32 P-ATP and Mn 2+ . 
Kinase assays were performed in kinase buffer (20 mM HEPES, pH 7.4, 10 mM 
30 MnCl 2 25^M cold ATP and 1 [id gamma - 32 P-ATP), at 30°C for 20 minutes. The 
reactions were stopped by adding conventional SDS electrophoresis buffer followed 
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by incubation of the mixture at 100°C for 3 minutes. Reaction products were 
separated by electrophoresis on 8-16% gradient gels. These gels were silver stained, 
dried, then exposed to storage phosphor screens which were analyzed using a 
Molecular Dynamics Phosphoimager. 
5 GNK-mediated phosphorylation of sGNK. was demonstrated using the kinase 

assay described above, except purified, recombinant sGNK was added to the kinase 
reaction mixture. Electrophoretic and Phosphoimager analysis of the reaction 
products were performed as described 

EXAMPLE 4 

1 0 Isolation of a genomic clone encoding a portion of the 

murine GNK gene and construction of an GNK gene targeting vector 

A lambda genomic library prepared from 129 DNA (Stratagene, La Jolla, CA) 
was screened with a human GNK cDNA. The insert from a hybridizing phage was 
subcloned as a Not I restriction fragment into pGEM 1 1 and mapped by a 

15 combination of sequencing, restriction mapping, and PCR analyses using primers 

based on the human GNK cDNA sequence. A homologous recombination vector was 
constructed in which an EcoRI fragment containing what we assume to be exon 1 was 
replaced with a PGK-neo cassette. The 5* end of the targeting vector extends to an 
Asp718 site 5' of exon 1 and the 3* end of the targeting vector extends to an EcoRI 

20 site 3* of what we assume to be exon 2. Additionally, an MC-TK cassette was 
subcloned into the 3" end of the vector. The backbone of the targeting vector is 
pGEM 1 1 . The PGK-neo and MC-TK cassettes are standard cassettes that confer, 
respectively, resistance to G418 and sensitivity to ganciclovir (see Fig. 10). 

EXAMPLE 5 

25 Generation of embryonic stem (ES) cell 

clones heterozygous for a targeted mutation in GNK 

The GNK homologous recombination vector was electroporated into 129 
derived embryonic stem (ES) cells maintained on irradiated primary embryonic 
fibroblast feeder layers in LIF containing media using standard techniques. 
30 Transfected cells were selected for 9-14 days in media containing 175 (ag/ml G418 
and 2 jiM ganciclovir using standard techniques. Resistant clones were expanded and 
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analyzed for the presence of a targeted GNK allele by PCR using the primers 

5 , -CCGGTGGATGTGGAATGTGTG-3 ! (SEQ ID NO: 5) and 

5'-CAAAGCCAAGGTTCATTCGGTG-3' (SEQ ID NO: 6) using equimolar 

concentrations of each primer. Colonies yielding the 1.4 kb PCR product expected 

5 for a targeted mutation in the GNK gene (and confirmed using the positive control 

vector; see Fig. 10) were expanded and used to generate chimeric mice. Additionally, 

genomic southern analyses using BamHI digested genomic DNA and a 1 .4 kb 

Asp718-Not I probe isolated from the positive control vector were used to confirm 

targeted disruption of the GNK gene; the wild type allele yields a 8.5 kb BamHI 

10 hybridizing band and the disrupted allele yields a 6 kb band. 

EXAMPLE 6 

Generation of chimeric mice using ES cells 
heterozygous for a targeted mutation in GNK 

ES cell clones heterozygous for a targeted mutation in GNK were used to 
15 generate chimeras by blastocyst injection of day 3.5 C57BL/6 blastocysts, followed 
by transfer of injected blastocysts into day 2.5 pseudopregnant Swiss- Webster 
recipients using standard techniques. The resulting male chimeras were bred to 
C57BL/6 females and germline transmission events were determined by a 
combination of coat color analyses and PCR analyses of ear punch biopsies using 
20 standard techniques. The primers used for these four primer PCR analyses (equimolar 
concentration of each primer) are: 

5'-GCCCTGAATGAACTGCAGGACG-3' (SEQ ID NO: 7) 
5'-CACGGGTAGCCAACGCTATGTC-3' (SEQ ID NO:8) 
5'-CTTCCGCTTCCACGACACTCG-3' (SEQ ID NO: 9) 
25 5 T -CTCAATGGCCTCAGACGCCAG-3' (SEQ ID NO: 10) 

The wild type GNK allele yields a 170 bp PCR product and the mutant GNK 
allele yields a 520 bp PCR product. Mice heterozygous for the targeted mutation in 
GNK yield both PCR products. 
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EXAMPLE 7 
Generation of GNK-deficient fetuses and fibroblasts 

Mice heterozygous for the targeted mutation in GNFC (GNK+/-) were 
intercrossed. Fetuses were obtained at various developmental stages and genotyped 
5 using the four primers described in Example 6. Fetuses homozygous for the GNK 
mutation (GNK-/-) yielded only the 520 bp PGR product. Fibroblasts homozygous 
for the GNK mutation were derived from embryonic day 1 1.5-13.5 (el 1.5-el3.5) 
fetuses obtained from GNK+/- intercrosses, genotyped as described above and 
cultured using standard techniques. 
10 EXAMPLE 8 

Oligomeriztion of GNK 
Human recombinant GNK (hu-rGNK), permitted to autophosphorylate as 
described above, for various times (t = 0, 5, 15, 30, 45, 60, 90, 120, 150, or 180 
minutes) at 30 °C, then analyzed by SDS-PAGE, revealed time dependent formation 
15 of high molecular weight aggregates of GNK. Based on co-electrophoresed molecular 
weight standards, a single silver-stained, radioactive, aggregate band migrated in the 
gel to a distance consistent with a moiety of - 350 kDa, suggesting a trimeric GNK 
complex. Higher molecular weight aggregates were also detected. 

Purified human recombinant GNK also behaves like a molecule of -300-350 
20 kDa on size exclusion chromatography (gel filtration) when compared to calibration 
standards run under identical conditions. Briefly, hu-rGNK loaded onto a Superdex 
200 column previously equilibrated with 20 mM HEPES, pH 7.4, and run at a flow 
rate of 2.0 ml/min elutes from the column with an apparent molecular weight of 300- 
350 kDa, again consistent with a trimeric complex. When the same Superdex 200 
25 analysis was performed on autophosphorylated hu-rGNK, GNK eluted from the 
column as higher order oligomers (>500 kD) as well as at 300-350 kDa. 

EXAMPLE 9 
Monoclonal Antibodies to sGNK 
This example illustrates the preparation of anti-sGNK monoclonal antibodies. 
30 sGNK is expressed in mammalian host cells such as COS-7 or 129 cells and purified 
by techniques generally known in the art. Purified or partially purified sGNK can be 



BNSDOCID: <WO 0036097 A2 I > 



WO 00/36097 



PCT/US99/29989 



- 53 - 

used to generate monoclonal antibodies against sGNK using conventional techniques, 
such as those described in U.S. Pat. No. 4.41K993. 

Briefly, BALB'c mice are immunized with sGNK emulsified in complete 
Freund's adjuvant and injected subcutaneously or intraperitoneally in amounts 
5 ranging from 10-100 ug. Ten or twelve days later, the immunized animals are 

boosted with additional sGNK emulsified in incomplete Freund's adjuvant. Mice are 
boosted thereafter on a weekly to biweekly immunization schedule. Serum samples 
are periodically taken by retro-orbital plexus bleeding or tail tip excision for testing 
for sGNK antibodies using conventional dot blot assay or ELISA. 
10 Following detection of an appropriate antibody titer positive animals are 

provided one last intravenous injection of sGNK in saline. Three to four days later, 
the animals are sacrificed, and spleen cells are harvested and fused to a murine 
myeloma cell line, e.g., NS1 or preferably P3x63Ag8.653 (ATCC CRL 1580). 
Fusions generate hybridoma cells, which are plated in multiple microtiter in a HAT 
15 (hypoxanthine, aminopterin, thymidine) selective medium to inhibit proliferation of 
non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells are screened by ELISA for reactivity against purified 
sGNK by adaptations disclosed in Engvall (Immunochem. 8:871, 1971) and in U.S. 
Pat. No. 4,700,004. A preferred screening technique is the antibody capture 
20 technique. (Beckmann et al., J. Immunol. 144:4212, 1990). Positive hybridoma cells 
can be injected intraperitoneally into syngeneic BALB/c mice to produce ascites 
fluids containing high concentrations of anti-sGNK monoclonal antibodies. 
Alternatively, hybridoma cells can be grown in vitro in flasks or roller bottles by 
various techniques. Monoclonal antibodies produced in mouse ascites fluids can be 
25 purified by ammonium sulfate precipitation, followed by gel exclusion 

chromatography. Alternatively, affinity chromatography based on binding of 
antibody to protein A or protein G can be used, as can affinity chromatography based 
on binding to sGNK. 
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EXAMPLE 10 

Identifying Novel Genes and/or Gene Products 
Involved in Regulating Vascularization 

Novel genes and gene products involved in regulating vascularization will be 
5 identified by comparing the gene expression or protein modification patterns in wild- 
type cells, organs ; or animals ("wt cells"), i.e., those expressing GNK and sGNK, with 
GNK- and/or sGNK-deficient cells , organs, or animals ("deficient 06115"). 

For example, ES cells (GNK +/+) and ES null cells (GNK -/-) will be 
propagated in vitro in parallel cultures, i.e., under identical culture conditions. After a 

10 predetermined period of time, the cells will be harvested by scraping, trypsinization, 
or other methods known in the art. The harvested cells will be pelleted by low speed 
centrifugation and the cell culture media decanted. 

The harvested cell pellets can be washed, if desired, to remove residual serum 
or media components and again pelleted by low speed centrifugation. The cell pellets 

15 will then be disrupted or lysed using either chemical methods (such as detergents or 
enzymes, for example) or physical methods such as sonication, French press or high 
shear forces, such as rapid pipetting through small bore orifices like pipette tips. 
These lysed cell preparations may be fractionated by various centrifugation or 
fractionation procedures known in the art to obtain for example, fractions enriched for 

20 cellular membranes, intracellular organelles such as nuclei or mitochondria, higher- 
ordered oligomeric complexes, cytosolic components, or the like. The skilled artisan 
will recognize that such fractions may be further purified or fractionated using, for 
example, additional centrifugation, chromatographic, electrophoretic, or other 
fractionation techniques. 

25 Once parallel fractions of the desired purity are obtained, these samples will be 

analyzed using conventional analytical techniques, such as electrophoresis or 
chromatography, and profiles generated. Comparison of wt cell profiles with 
deficient cell profiles will allow the identification of differentially expressed genes 
and/or modified proteins. For example, the radiolabeled phosphoprotein profiles 

30 visualized by autoradiography of 2-dimensional polyacrylamide gels will be different 
between parallel extracts from cultures of wt and GNK-deficient cells propagated in 
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media containing y- 32 P-ATP, as sGNK will be radiolabeled in the wt cell extracts, but 
not the GNK-deficient cell extracts. Similarly, differential gene expression will be 
observed by the absence or enhancement of a particular polypeptide in one sample, 
but not the other. 
5 EXAMPLE 1 1 

Vascular Defects in GNK Deficient Mice 
GNK deficient (GNK-/-) fetuses were obtained by crossing mice that were 
heterozygous for the GNK targeted mutation (GNK+/-), as described in Example 7. 
GNK deficiency resulted in lethality at approximately embryonic day 1 1.5 (el 1.5). At 

10 a gross level, GNK deficient fetuses and yolk sacs appeared undervascularized. To 
examine the role of GNK in vascular development and function in more detail, the 
TIE2-lacZ transgene (Schlaeger TM et al (1997) Proc. Natl. Acad. Sci. 94:3058-3063) 
was moved onto the GNK deficient background. The lacZ transgene is expressed 
only in endothelial cells and thus provides a means of easily visualizing blood vessel 

15 structures during development. 

Vascular structures in TIE2-lacZ transgenic tissues that were either GNK 
deficient (GNK-/-) or GNK sufficient were visualized histochemically by staining for 
P-galactosidase activity essentially as described in Hogan et al. (Manipulating the 
mouse embryo: A laboratory manual, CSH Press, 1994). As shown in Figure 11, yolk 

20 sacs derived from GNK-/-TIE2-lacZ+ fetuses at el 0.5 (Figure 1 IB) lacked the 

organized vascular structures readily evident in el 0.5 GNK sufficient TIE2-lacZ+ 
yolk sacs (Figure 1 1 A). 

Similar methods are used to analyze vascular structures in the yolk sac and the 
embryo proper at e9.5 and el 1 .5. To study the role of GNK in the development of 

25 functional endothelial cells, chimeric analyses are performed using the rosa26 system 
(Zambrowicz BP et al (1997) Proc. Natl. Acad. Sci. 94: 3789-3794). Briefly, GNK-/- 
ES cells are injected into rosa26 blastocysts. At various embryonic stages in adults, 
organs derived from the resulting chimeras are stained for P-galactosidase activity to 
determine which lineages are dependent upon GNK for development. In situ 

30 hybridization for GNK further demonstrates the temporal and spatial expression of 
GNK during development. 
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It will be apparent to those skilled in the art that various modifications and 
variations can be made in the disclosed methods and compositions without departing 
from the scope or spirit of the invention. Other embodiments of the invention will be 
apparent to those skilled in the art from consideration of the specification and practice 
5 of the invention disclosed herein. It is intended that the specification and examples be 
considered as exemplary only, with a true scope and spirit of the invention being 
indicated by the following claims. 



3NSDOCID: <WO 0036097A2 I > 



WO 00/36097 



PCT/US99/29989 



-57- 

WHAT IS CLAIMED IS : 

1 . An isolated nucleic acid molecule encoding sGNK or a variant thereof. 

2. An isolated nucleic acid molecule of claim 1 , comprising a sequence 
selected from the group consisting of (a) the sequence of nucleotides in SEQ [D NO: 
1, from nucleotide I to nucleotide 4610; (b) nucleic acid molecules capable of 
hybridization to a nucleic acid molecule of (a) under conditions of moderate 
stringency, and which encode sGNK; and (c) nucleic acid molecules which are 
degenerate, as a result of the genetic code, with respect to the nucleic acid molecules 
of (a) or (b). 

3. An isolated nucleic acid molecule of claim 1, comprising a sequence 
selected from the group consisting of (a) the sequence of nucleotides in SEQ ID NO: 
1, from nucleotide 75 to nucleotide 2549; (b) nucleic acid molecules capable of 
hybridization to a nucleic acid molecule of (a) under conditions of moderate 
stringency, and which encode sGNK; and (c) nucleic acid molecules which are 
degenerate, as a result of the genetic code, with respect to the nucleic acid molecules 
of (a) or (b). 

4. A recombinant expression vector comprising a promoter operably 
linked to a nucleic acid molecule according to claim 1, 2, or 3. 

5. A host cell containing the recombinant expression vector of claim 4. 

6. An isolated sGNK polypeptide. 

7. An isolated sGNK polypeptide that is encoded by the nucleic acid of 
one of claims 1-3. 

8. An isolated sGNK polypeptide comprising an amino acid sequence 
that is at least 80% identical to SEQ ID NO: 2. 

9. The sGNK polypeptide of claim 8 comprising the sequence SEQ ID 

NO: 2. 

10. An isolated GNK comprising a sufficient number of amino acids from 
SEQ ID NO: 4 to confer on the polypeptide vascularization regulatory activity on 
cells of mammalian origin. 
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11. An isolated nucleic acid molecule comprising a sufficient number of 

nucleotides from SEQ ID NO: 3 to encode a polypeptide that regulates 
vascularization. 

12. A recombinant expression vector comprising a promoter operably 
linked to the nucleic acid sequence of claim 1 1 . 

13. A host cell containing the recombinant expression vector of claim 12. 

1 4. A method for producing a polypeptide having vascularization 
regulating activity comprising culturing the recombinant host cell according to claim 
13 under suitable conditions to express the polypeptide encoded by the nucleic acid 
molecule. 

15. The method of claim 14 wherein the host cell is a bacterial yeast, 
insect or mammalian cell. 

16. The method of claim 14 wherein the host cell is a COS cell. 

17. The method of claim 14 wherein the host cell is a 293 cell. 

1 8. A method for stimulating blood vessel development comprising 
administering a therapeutically effective amount of GNK or GNK agonist to a patient. 

1 9. The method of claim 1 8 wherein the stimulation of blood vessel 
development promotes wound healing. 

20. The method of claim 18 wherein the stimulation of blood vessel 
development reduces cardiac dysfunction. 

21. A method of reducing blood vessel development comprising 
administering a therapeutically effective amount of a GNK antogonist to a patient. 

22. The method of claim 21 wherein the reduction of blood vessel 
development slows or prevents tumor development. 

23. ^ A method for treating a vascularization disorder comprising the step of 
administering to a patient a therapeutically effective amount of a GNK antagonist. 

24. The method of claim 23 wherein the vascularization disorder is 
psoriasis. 

25. The method of claim 23 wherein the vascularization disorder is 
arthritis. 
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26. The method of claim 23 wherein the vascularization disorder is 
proliferative retinopathy. 

27. A pharmaceutical composition comprising GNK. 

2S. A pharmaceutical composition comprising at least one GNK agonist. 

29. A pharmaceutical composition comprising at least one GNK 
antagonist. 

30. An antibody molecule, or fragment thereof, which binds specifically to 
the vascularization regulatory domain of GNK. 

31. An antibody molecule, or fragment thereof, which binds specifically to 
a portion of the sGNK of one of claims 6-9. 

32. The antibody molecule, or fragment thereof, of claim 31 which is a 
polyclonal antibody. 

33. The antibody molecule, or fragment thereof, of claim 31 which is a 

monoclonal antibody. 

34. A method for substantially purifying sGNK comprising: 
coupling the antibody molecule, or fragment thereof of any of claims 30 to 

33, to a solid support to form a matrix that selectively binds sGNK; 

applying a mixture of polypeptides containing the sGNK to the matrix; 

introducing conditions under which the sGNK and the antibody molecule, or 
fragment thereof, form an antigen-antibody complex; 

washing unbound components of the mixture of polypeptides from the matrix; 

and 

eluting the sGNK from the matrix. 

35. A homologous recombination vector comprising a nucleotide sequence 
substantially similar to SEQ ID NO: 3, the sequence differing from SEQ ID NO: 3 by 
the addition, deletion, or substitution of one or more nucleotides to prevent expression 
of a polypeptide with vascularization regulatory capability, structurally linked to one 
or more selectable marker genes. 

36. The homologous recombination vector of claim 35 wherein at least one 
selectable marker gene confers resistance to G418. 
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37. The homologous recombination vector of claim 35 wherein at least one 
selectable marker gene confers sensitivity to ganciclovir. 

38. A method for generating GNK-deficient ceils comprising: 
transfecting cells with a homologous recombination vector that is incapable of 

expressing biologically active GNK and that contains a selectable marker; 
selecting for transfected cells using selective medium; 
propagating the transfected cells in culture; and 
monitoring the propagated cells for GNK expression. 

39. A method for identifying genes or gene products involved in regulating 
vascularization comprising: 

propagating GNK-deficient cells or organs in parallel with wild-type cells or 

organs; 

preparing parallel samples from the GNK-deficient and wild-type cells for 
analysis; 

comparing the parallel GNK-deficient and wild-type samples for differential 
gene expression or protein modification; and 

identifying the differentially expressed gene or differentially modified protein. 

40. A method for identifying genes or gene products involved in regulating 
vascularization comprising: 

propagating sGNK-deficient cells or organs in parallel with wild-type cells or 

organs; 

preparing parallel samples from the sGNK-deficient and wild-type cells for 
analysis; 

comparing the parallel sGNK-deficient and wild-type samples for differential 
gene expression or protein modification; and 

identifying the differentially expressed gene or differentially modified protein. 

41 . A method of identifying a compound that modulates a protein-protein 
interaction between GNK and sGNK, comprising: 

a) contacting a candidate compound with GNK and sGNK under conditions 
permitting the interaction between GNK and sGNK, and 
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b) measuring the ability of the candidate compound to modulate the 
interaction between GNK and sGNK. 

42. A method of identifying a compound that modulates phosphorylation 
of sGNK by GNK, comprising: 

a) contacting a candidate compound with GNK and sGNK under conditions 
permitting the phosphorylation of sGNK by GNK, and 

b) measuring the ability of the candidate compound to modulate the 
phosphorylation of sGNK by GNK. 

43. A method of identifying a compound that modulates vascularization 

comprising: 

a) contacting a candidate compound with GNK or sGNK, and 

b) measuring the ability of the candidate compound to modulate a biological 
activity of the GNK or sGNK. 

44. A compound identifiable by a method according to one of claims 

41-43. 

45. The compound of claim 44 wherein the compound is selected from the 
group consisting of activators of GNK, inhibitors of GNK, activators of sGNK, 
inhibitors of sGNK ? activators of an interaction between GNK and sGNK, inhibitors 
of the interaction between GNK and sGNK, activators of phosphorylation of sGNK 
by GNK, and inhibitors of phosphorylation of sGNK by GNK. 

46. A nonhuman transgenic embryo, fetus, or animal that is heterozygous 
for a GNK targeted mutation. 

47. A nonhuman GNK-deficient embryo or fetus produced by crossing 
heterozygous animals according to claim 46. 

48. A cell from the embryo, fetus, or animal of claim 46 or 47. 

49. A GNK deficient cell. 

50. An sGNK deficient cell. 

51. A recombinant vector comprising a promoter operably linked to a 
nucleic acid molecule encoding sGNK according to claim 8. 

52. A host cell containing the recombinant expression vector of claim 5 1 . 
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CAGCGTGGAC 


AGCAGGTGAG 


3701 


TCTTCATTGA 


GGGAAAGAAA 


3751 


TGATTCTTTA 


CTTGCAAATT 


3801 


AACCTTGAGT 


GTGCTTTGGG 


3851 


CAAGCTGTAT 


GAACGCCACG 


3901 


CGACGCTGTG 


TTGGCATGCG 


3951 


CTTTGCAAAT 


GTGAAAGCGA 


4001 


TAATCACTGT 


TAATTTCAGG 


4051 


CCAGGTCTAC 


ATCCTGTTTT 


4101 


AAATTCGTAC 


TCTTGGAGGA 


4151 


GAAAAATTAT 


AATAATATAA 


4201 


GAATAAGCAT 


TTCTTATTCC 


4251 


GTTACAGTGT 


TGATTCATAA 


4301 


CTGTATGGGG 


TATTTATTTG 


4351 


TGCATTACCT 


TAGTGTCAGA 


4401 


TGTTCTCCTG 


CCAAGGAACC 


4451 


TCAAATGCCT 


TCAATTTCTG 


4501 


AGGAAGGAGA 


AATTAATTGA 


4551 


TGCCTACAAA 


GCCAGTGTGC 


4601 


CATTGGTAGG 





GG TTTTTC AT ATTG T ATT AT TATTATTCTT 
AGAAGAAAAC TTGACCTTTG CTCTGGTTAA 
AAAAAAAGAT AAATTATTGA ATGAGTAGTA 
TTTCTGGGTT TTAGGACGTT GTGAAGCATG 
ACAACTGTAC CAATAAAATT CCAAATTGGA 
GTTGTGCCAA ATTGTGGTAC ACTTAGAAAA 
AGGGTGTTCT CTTTCAACAC CTTTTTGTTA 
CTTCATCAGT TAAGGGAGGT GTCCCAGCAC 
CAGGGAAGAG C TAG TOGO C A TGCTGAAGGC 
GCAGGTGCTC CTCACACCCA GACCTGGGCA 
ACAGTCATTG TGCAAAATTC TGTTAGTCAG 
CAGGGGCTTA GAAAATGAAA GCAAACACAA 
AACCAAATGG ACCTTCTGGG ACAAGCTGAG 
TTTGTGAAGA GCTGAGGGTA TCAGGAGGGC 
CAGTAGGGGA TGAGGGTTAG C CAT AGTATT 
GACATTATAT CTTCTCTTGC TTGGTGTAAC 
AAACAGAACT CATTAAAACT CCTTAGCAAA 
GTTTGCTGAG TGAGGTTAGT GGGAGTGGTC 
AGAAAAACTG TCCTTCCTTC TCCAAAAAAG 
ATGACAAAAA TAAAAGAATT CTGTTTCCTG 
TAGTTGTAGG GACTCCTATT TTTACCTTCC 
GAAATATTGT TACATTTGAG ATAACTTCAT 
CAATGATGTC TGAGTACTGT ATTTTTTCTG 
ATGTTGGTCT TTATTTTAAA GTCATATGCA 
TTTACACAGA CCCAAACAAA AAAATAATAA 
AGAAAATGAG GCAGAGCATG GAAAAGGAAT 
GATTTTCAGG ACACAGACAT ATGATGTGAA 
ATAGGAACAG TGGGCCTGGG TAAAGAGTCA 



BNSDOCID: <WO 0036097A2 f > 



WO 00/36097 4 /15 PCT/US99/29989 



1 MSAPSESEY ARLVMEAQFE WLRAEVKKLS KHLAHTTTREK IQAAEVGLAV 

101 ESLIyESASK 3QYWRKVLE LEJFELKQLSN VL1*7IT^SENE RLASV^XJELK 

151 ZZM^VEZQR OUi^EDIKEY :<rEEAHLLQD YSELESNXS LQK^SVLRQ 

201 NQV^jrtiLKH EZXRLEEETE YLMSQL3IAI RLKHISEPCL HHALETUCTE 

251 RagKNSLHKE LSKYMSUCS rYTSHLKVSL COJ<FSnDAA EPWE&EALV 

301 KKsELftPFSP SLVSELLSEL NZSEZQKLKQ 

351 QLM3<EP£KA GT J ATLQLJTQ KQTJEHTFGSL S32QH0/THL THOLSALRRL 

401 QASKER7TAL ENEXERDSHE DGDYYEVDIN GPEILPCKYH VAVBEKELR 

451 EQLraLSSIK ZAREAQHAEE KCS^EA33QA LTEXVSLLEK ASRQCRELLA 

501 RLEKELKKVS DV?£2TQGSL SV^ELVTF SEZLANLYKH VCM^KETJW 

551 RVMLDYYREG QGG^CTOSFG HSFILLPFEL LAFE¥3*AEG 

601 CTGDSSPSPG SSLPSFLSDP HRESWUVNL IAIXREQXKH IjQAAVEKTTE 

651 LSPQRIASQE LGPAVEKEKE ALMFFTTKLK SLLSIXRBQI TTLRIVLiCftN 

701 KCEAEVAI^N LKSKYENEKA MS/rOTMCLil NELKALKBCA ATFSSLRAMF 

751 AFTHKHTTLMS ALTQKLHXE 

801 LEHS^IKFGl AKAAFKIKPA TPSL* 



FIGURE 2 
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1 ATOIU3Groc TZZGCOZGrA CTOCTPOC TGCGAITCCA TCAACTCGGA 



101 CC^CTC^GGG GCCGCGAGCC 03GC3G?GC^ GGrOGP^Arro 

151 cacEAcaroc aaatuuuur arimxrac ccaxxrncG gggapgcqc 

201 GC^CT?cca: cgcacctmg Aro?oic?cr ojv . u i u - ui aposapstcg 

251 JCTTCAGOOG C^CTCIUAG APOGA^It: GIGftlOXTV gaatcp^att 

301 GITATiCTGG C^CTGCIXXA GCAIT^AAC ATESTCCCr PCT2CAA0TA 

351 CTr^ATOGAC AA230CAO3C TCCTGAXIGA GCTOGAAXAT TCTEAATO3SG 

401 G3AACCTOIA TCACAAAA2C criUJlt^ a. MG?CA2GIT Gl ' l'll^ AA 

451 GSCTlwjiULi TCTOGraOCT Ul ' l ' lCA XM TGMCK3CAT 

501 CCAIAAAG CT aGMUU-TlTJ AXAGMAIAT AA^VSCATIA AAIAXITITC 

551 TGSOCAM3C AAAOCTCMA AAAL.T1ULK3 A' l T A ' UjJUC T SOZAA2GAAA 

601 CTEAATICIG AJiAi'iUJAT GXT^SCG CTTCT3QGAA aXCAXATm 

651 OVIGTICICCA GMCICTOIC AMGSGEAAA GTESCAA2TIC Afl GVEUKJA m 

701 TCXOJJL^Gr ATITTTGAAC TOCITOCCIT AAMMGA03 

751 TITGATOCTA CAAAOQZRCT TAACTIGnCT GIGAAGAIOG TOCA2GGAAT 

801 TO333CCATC GMCTTCSCT CTMCCMIA CICITTOGAA TIGATCCAAA 

851 TOLsTlUATlC GIGCCTTCAC CMGATOCTC A3C3>G?GMC TSCTOCSCOT 

901 GMCITCTAG ATOXCCTCT TC T Cft QGAAA 03C3OGSfflG A2ATO3MGA 

951 AAAACT3CT CIT3CTTAATC CSOCT?CAAA GSGMCAMG TCAMC2CIG 
1001 MamXTOCT GTSGMCAT CPOGMCCM TGAAGICEAT 

1051 GTITO33G7IG GTOGAAAATC CTOGCTXCTG AAACTGGATC TTMCAMfiG 

1101 T03^TCraGT GCD03QCMG TCTCTT3CTOG GAA2MCCAC ITIUJlUlUi 

1151 TC3^3?iaGA GAMGAWTTC TCAPCA1QCA 

1201 AAflCTOCATC GICK3CIQ3G CXATO3CPC AAMCCTOCT ATOGROOOC 

FIGURE 3 
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1251 AAPOCATCTG Z^AXJTIGC TATOIAICAG G I U I U X XXJI X Z 

1301 G?rararrr qctcictct gtcpcigatg a33gtcmct cta i ujlti c 

1351 GGACC?G\TT ATT TCTXXTTC CATGGGGGTC G?OA?OTIG 

14C1 ACTSCTSGSA CCSOTCTCC TCAPCITCIT OC?GroGSGC 

1451 MGTItZTCCTG TGGr^VTAAT CATCTOGTOS TRTIGrCMG 

1501 GICiAi'iLTr CGrAIMOGA QGA CIXXio 'lT T3GATTQ^3V 

1551 PCAGGATraT TSmC?iOaC AAA^SJTO GA TCITCCCAPC UJL ' l ' liajL ' IA 

1601 i'lUi'iU^T CAITICICIT 

1651 G3C^A?CT3C: TG3CCTX7IX3G ^CICAMGAA TICAA1?A3C . TO XrlUIGRA 

1701 TCSOITXaSC TZAPO^ATCA GSA Jl ' lUJL T 

1751 PCPCMOoIC CrriPOCTTG GCCAAr^TT 'llJlU.Ti.TlA TAMATOCCT 

1801 Mcaa'iuuoc cpcogapgsc tcscscmct acmrTCTro ^aDGratxxx; 

1851 ui'iajiuAoc rnarrxA acmgtctos qc^ jl ' Iujlc oTiumA flcr 

1901 PC?>?GA?GCG TCTQQoAAIC raa ^ XTCCT 

1951 CAAjIUAITJA OTGTIGMGmG TITOOCAITG CTOCTSCKA 

2001 TCATAATCSC AlTiTHJUL T CXOX^ATOG OJUL ' IUJLA A 

2051 GAI03CTCT3 ATAITTIGTAC CICAIOXCT 

2101 CCTCCTAITT TTT3GAICICT GCAICATCTC CG3G?iJCTGT Cl'lCXJLXJIGG 

2151 'i'l^GAA^GT ATTGAATICT 

2201 GITCTAESG CB^IUULTIA TOSOTOGAA CTOIGITTCA G MC ' X C a QC 

2251 acqaGSflGgc (smaros azroGrracr gamammg pcpoxcpgcp^ 

2301 QGAATCIGAA ACICCTCACC CAP£JIT3GMG CTIOOGMGA 

2351 CMPOCIMS AAIQGAMuTr CAia33GMC 

2401 i^CTAAiaaas ccmcmcic croiariax: tjjltiujaa mgmctoga 

2451 TTTATOOXA CCCAIUITXT 

2501 aSITTICMA ATCKMAAA CCTATCMCA QCIQCMGGA 
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2701 
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1 MSVIJGEYEFH CBSTNSIFGS E933MDSS? GPSASQGFRA G3GAAH3EEL. 

101 VTLALIfiKEN I IAYj^MiFMD >nn r T.TTT-sy CMSGNLCTC li^TKLFEE 

151 EMWWYLFQI VSAVSCIKKA GXLHRDXJCTL MZFUH^MLX KLGDVGLAKK 

201 LMSEY2-JAZT L\aTFV n 2<5S? EIjCQGV/TQKF KSDr/JAMTV IFELLTIiOT 

251 FCMNFLNLC VIC^CZ^AM EVESSQYSLZ LIQC^KSCLD QDFBQRFEAD 

301 ELLCRFLLHK RRREMEEX^T LJ^IAPIXRF?. SST/TEAPXA WISPTSEW 

351 V/JG3C5C3TPQ KLCV32C3GCS ARQS-C?£NIH FAWIVSKEL YIVfl/TMQQGT 

401 KLH3QLGH3D KASYRQFKr?/ EKLQGKAIH2 VSJLil*'l\C Vll H JfJL^F 

451 GSDTCO>GV E2<V?£3 5 EVLE FM^LNFFLSM FVS3VSOXN HVWLTRNKE 

501 VVSWGC3GEVG RLC3XSESCY YTP^CVEVFK ALIIVA\QCS OXJIFLLTQS 

551 GKVLA2GLNE F!^KLGI2<X3CM EVFYTTSFTL AK3QLSFYKIR 

601 TTAPGKTHEA. AxDEPGKI-LT P3CMKC3QIJC3 U^YKKRIiGI NLLOCE1QGK 

651 QVIRVSCESX FIXSAmTJH XrPWGN32^G RLAMTFTERP H3SDICISWP 

701 FPIFGSLHHV FELSCPG^KT KTCBSNSSGL S1GIVFQSSS 

751 PG33Q33333 EESDSQQESE TPDP9QGErEG TMEACPQ^HG LISFIEM1XJ 

801 SN3ASS3CPG wrg ypr^ g FIPMPDSPSP LSAAFSESEX E7TLPYEEUQG 

851 tXVASEAFLE HKPCVEASSP KGTFLTPPAC PCSSL&JEHE 

901 RLQGLVIiCL AEQCKLQCEN LQIFTQU^KL NKKTfTTQg/ GMSfciii^lA 

951 KEEMEM3OT IXDSDSWCLL GIDSCRPSL* 



FIGURE 4 
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Putative Dcn^ s*,.,^ 
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51 r^™"^ ^rmvfTvP TO, waf -^^.^^ v^p^f^ 

101 tt.w,^ ^ rTTT ^ n ^.^ Tf TT TT -^ 

201 u*™™™ Mrrrnvvrtsp Fm^r t^t™^, Trrrr nT 
251 ^rrw ^<r^- r T ^ n -rr rmrmn 

301 PTTT-PPTTp^ 

351 WQ^STT^ VTr^rrvYpg A p^ri ^ nv ^firry 

401 K L HTTTfTTD V7V.^vp OP ^ / p^r^^p , r ^ rpryag 

451 

501 ^^^^ aLC:roi 



551 ^ri 

601 SiaEgJOHIA K3CNKD33LG U^KKRLGT NLU3GPL0GK 

651 CV1RVSCTE PITaftTODH IK^D^C RLAMTPTCSP HSSniCISWP 
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sGNK is a substrate for GNK in vitro. 
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FIGURE 8 
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Final GNK purification step: 
microbore Mono Q column chromatography 




silver autokinase 
stain activity 



FIGURE 9 
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SEQUENCE LISTING 

<1 10> Bird, Timothy A. 

Peschon, Jacques J. 
Sims, John E. 
Virca. G. Duke 
Willis, Cynthia R. 

<120> Methods for Regulating Vascularization Using GEF 
Containing NEK-Like Kinase (GNK) 

<130> Immunex GNK/sGNK PCT 

<140> Not Yet Assigned 
<141> 1999-12-17 

<150> 60/113,003 
<151> 1998-12-18 

<160> 19 

<170> Patentln Ver. 2.0 

<210> 1 
<211>4610 
<212> DNA 
<213> Homo sapiens 

<400> 1 

gcggcagcgg cggcggctga ggagggcccg gcctgcgaga gcctcagtgg gagccggctc 60 
agccctcggc caccatgtcg gcgccgtcgg aggaggagga gtacgcgcgg ctggtgatgg 120 
aggcgcagcc ggagtggctg cgcgccgagg tgaagcggct gtcccacgag ctggccgaga 180 
ccacgcgtga gaagatccag gcggccgagt acgggctggc ggtgctcgag gagaagcacc 240 
agctcaagct gcagttcgag gagctcgagg tggactatga ggctatccgc agcgagatgg 300 
agcagctcaa ggaggccttt ggacaagcac acacaaacca caagaaggtg gctgctgacg 360 
gagagagccg ggaggagagc ctgatccagg agtcggcctc caaggagcag tactacgtgc 420 
ggaaggtgct agagctgcag acggagctga agcagttgcg caatgtcctc accaacacgc 480 
agtcggagaa tgagcgcctg gcctctgtgg cccaggagct gaaggagatc aaccagaatg 540 
tggagatcca gcgtggccgc ctgcgggatg acatcaagga gtacaaattc cgggaagctc 600 
gtctgctgca ggactactcg gaactggagg aggagaacat cagcctgcag aagcaagtgt 660 
ctgtgctcag acagaaccag gtggagtttg agggcctcaa gcatgagatc aagcgtctgg 720 
aggaggagac cgagtacctc aacagccagc tggaggatgc catccgcctc aaggagatct 780 
cagagcggca gctggaggag gcgctggaga ccctgaagac ggagcgcgaa cagaagaaca 840 
gcctgcgcaa ggagctgtca cactacatga gcatcaatga ctccttctac accagccacc 900 
tgcatgtctc gctggatggc ctcaagttca gtgacgatgc tgccgagccc aacaacgatg 960 
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ccgaggccct ggtcaatggc tttgagcacg gcggcctggc caagctgcca ctggacaaca 1020 
agacctccac gcccaagaag gagggcctcg caccgccctc ccccagcctc gtctccgacc 1080 
tactcagtga gctcaacatc tctgagatcc agaagctgaa gcagcagctg atgcagatgg 1 140 
agcgggaaaa ggcgggcctg ctggcaacgc tgcaggacac acaeaaecae ctggagcaca POO 
cgcggggctc cctgtcagaa cagcaggaga aggtgaccctj cctcacagaeaatct^to 1260 
ccctgcggcg cctgcaggcc agcaaggagc ggcagacagc cctagacaac aayaaseacc 13*) 
gtgacagcca tgaggatggg gactactacg aggtggacat caacsascct gagatctw° 1380 
cctgcaagta ccatgtggct gtggctgagg ctggcgagct ccgcgagcag ctcaaggcac 1440 
tgcgcagcac gcacgaggct cgtgaggccc agcacgccga ggagaagggc cgctatgagg 1500 
ctgagggcca ggcactcacg gagaaggtct ccctgctaga gaaggccagc cgccaggacc 1560 
gcgagctgct ggcccggctg gagaaggagc taaagaaggt gagcgacstc gccggcgaaa 1620 
cacagggcag cctgagtgtg gcccaggatg agctggtgac cttcastgag gagctgocca~l 680 
atctctacca ccacgtgtgc atgtgcaaca atgagacacc caaccgtstc atgctggact 1740 
actaccgcga gggccagggc ggggccggcc gcaccagtcc cgggggccgc accagcccce 1800 
aggcgcgtgg ccggcgctca cccatcctcc tacccaaggg gctgctssct cctgaggcga I860 
gccgagcaga tggtgggacg ggggacagca gcccctcgcc tggctcctca ctgccatcac 1920 
ccctgagtga cccacgccgg gagcccatga acatctacaa ccteatcgct atcatccgtg 1980 
accagatcaa gcacctgcag gcagccgtgg accgcaccac ggagctgtca cgccagcgca ?040 
ttgcctctca ggagctgggc cccgccgtgg acaaggacaa ggaagcgctt atggaggaga 2100 
tcctcaagct gaagtcgctg ctcagcacca agcgggagca gatcaccacg ctgcgcactg 2 1 60 
tgctcaaggc caacaagcag acggccgagg tggcccttgc caacctgaag agcaagtatg 2220 
agaatgagaa ggccatggtt accgagacca tgatgaagct gcgcaatgag ctcaaggccc 2280 
tcaaggagga cgcagccacc ttctcctcgc tgcgtgctat gtttgccacc aggtgtgacg ?340 
agtacattac acagctggat gagatgcagc ggcagctggc ggctgctgag gacgagaaga 2400 
agacgctgaa ctcgctgctg cgcatggcca tccagcagaa gctggcgctg acccagcggc 2460 
tggagctgct cgagctggac catgagcaga cccggcgtgg ccgtgccaaa gccgccccga 2520 
agaccaagcc agccacaccg agcctgtaga gtagctgcca ggaggacttg gccacccggc 2580 
cctgtcacac tgcagcccct tccccttccc tctcgtggcc cacaaggagg aaggaagggc 2640 
aacctaaaag cccacttaga aactttttgg atatgccact gcaattcttt tcaaaatagc 2700 
attccccagg tttttaatgg gaggaaaaaa agctttaatg ttgagcatgc tgcgagctgc 2760 
tgcgtggaaa ggcctctgta tgggccgaag acccttcttc cctggctgcc aggctcgcca 2820 
ggagcccact ggaaacgccc accacggggg ctccttgtta cacatgttct ttttttatcc 2880 
gatcaacctg tgcacttttg atattttgat attatatttg cttccttaat tcctcgcgta 2940 
gagacggtct caggtgccgt ggtctatgct cgtggtcctg tagctgtccg cctcagctcc 3000 
caccgtgttt gtctggtgtc agcacgaggc agagctgtgt gctccatagc gtgtagcttt 3060 
agactcggag atgagtgctt tgacccagcg aggagctcag ctaagtgtat ccacgctgtg 3120 
gttcagcagc ctttagatca tacggcattg tggttcatgt ttgaaattac agattttaaa 3 1 80 
tgccatgttc attaagaaat ccagggtatt cagattctgg ggtttttcat attgtattat 3240 
tattattctt aggaatagtt caatgtaaca agaagaaaac ttgacctttg ctctggttaa 3300 
aacagtaata ggcacttgaa aaaaaaagat aaattattga atgagtagta ttacctacaa 3360 
attccagaat tttctgggtt ttaggacgtt gtgaagcatg actgattaac agaattttat 3420 
acaactgtac caataaaatt ccaaattgga attgttttgt tactctggtt gttgtgccaa 3480 
attgtggtac acttagaaaa ttctacagtc gtcgattttt agggtgttct ctttcaacac 3540 
ctttttgtta gtaatcattg ccagtagtgc cttcatcagt taaggg aggt gtcccagcac 3600 
agatcattct caaaagcgag cagggaagag ctagtgggca tgctgaaggc cagcgtggac 3660 
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agcaggtgag gcaggtgctc ctcacaccca gacctgggca tcttcattga gggaaagaaa 3720 
acagtcattg tgcaaaattc tgttagtcag tgattcttta cttgcaaatt caggggctta 3780 
gaaaatgaaa gcaaacacaa aaccttgagt gtgctttggg aaccaaatgg accttctggg 3840 
acaagctgag caagctgtat gaacgccacg tttgtgaaga gctgagggta tcaggagggc 3900 
cgacgctgtg ttggcatgcg cagtagggga tgagggttag ccatagtatt ctttgcaaat 3960 
gtgaaagcga gacattatat cctctcttgc ttggtgtaac taatcactgt taatttcagg 4020 
aaacagaact cattaaaact ccttagcaaa ccaggtctac atcctgtttt gtttgctgag 4080 
tgaggttagt gggagtggtc aaattggtac tcttggagga agaaaaactg tccttccttc 4140 
tccaaaaaag gaaaaattat aataatataa atgacaaaaa taaaagaatt ctgtttcctg 4200 
gaataagcat ttcttattcc tagttgtagg gactcctatt tttaccttcc gttacagtgt 4260 
tgattcataa gaaatattgt tacatttgag ataacttcat ctgtatgggg tatttatttg 4320 
caatgatgtc tgagtactgt attttttctg tgcattacct tagtgtcaga atgttggtct 4380 
ttattttaaa gtcatatgca tgttctcctg ccaaggaacc tttacacaga cccaaacaaa 4440 
aaaataataa tcaaatgcct tcaatttctg agaaaatgag gcagagcatg gaaaaggaat 4500 
aggaaggaga aattaattga gattttcagg acacagacat atgatgtgaa tgcctacaaa 4560 
gccagtgtgc ataggaacag tgggcctggg taaagagtca cattggtagg 4610 

<210>2 
<211>824 
<212> PRT 
<213> Homo sapiens 

<400> 2 

Met Ser Ala Pro Ser Glu Glu Glu Glu Tyr Ala Arg Leu Val Met Glu 
15 10 15 

Ala Gin Pro Glu Tip Leu Arg Ala Glu Val Lys Arg Leu Ser His Glu 

20 25. 30 

Leu Ala Glu Thr Thr Arg Glu Lys He Gin Ala Ala Glu Tyr Gly Leu 
35 40 45 

Ala Val Leu Glu Glu Lys His Gin Leu Lys Leu Gin Phe Glu Glu Leu 
50 55 60 

Glu Val Asp Tyr Glu Ala He Arg Ser Glu Met Glu Gin Leu Lys Glu 
65 70 75 80 

Ala Phe Gly Gin Ala His Thr Asn His Lys Lys Val Ala Ala Asp Gly 
85 90 95 

Glu Ser Arg Glu Glu Ser Leu lie Gin Glu Ser Ala Ser Lys Glu Gin 
100 105 110 

Tyr Tyr Val Arg Lys Val Leu Glu Leu Gin Thr Glu Leu Lys Gin Leu 
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115 120 125 

Arg Asn Val Leu Thr Asn Thr Gin Ser Glu Asn Glu Arg Leu Ala Ser 
130 135 140 

Val Ala Gin Glu Leu Lys G'.u lie Asn Gin Asn Val Glu He Gin Ar« 
145 150 155 160 " ° 

Gly Arg Leu Arg Asp Asp He Lys Glu Tyr Lvs Phe Arg Glu Ala Ar<* 
165 170 175 

Leu Leu Gin Asp Tyr Ser Glu Leu Glu Glu Glu Asn He Ser Leu Gin 
180 185 190 

Lys Gin Val Ser Val Leu Arg Gin Asn Gin Val Glu Phe Glu Glv Leu 
195 200 205 

Lys His Glu He Lys Arg Leu Glu Glu Glu Thr Glu Tyr Leu Asn Ser 
210 215 220 

Gin Leu Glu Asp Ala He Arg Leu Lys Glu He Ser Glu Are Gin Leu 
225 230 235 240 

Glu Glu Ala Leu Glu Thr Leu Lys Thr Glu Arg Glu Gin Lys Asn Ser 
245 250 255 

Leu Arg Lys Glu Leu Ser His Tyr Met Ser He Asn Asp Ser Phe Tyr 
260 265 270 

Thr Ser His Leu His Val Ser Leu Asp Gly Leu Lys Phe Ser Asp Asp 
275 280 285 

Ala Ala Glu Pro Asn Asn Asp Ala Glu Ala Leu Val Asn Gly Phe Glu 
290 295 300 

His Gly Gly Leu Ala Lys Leu Pro Leu Asp Asn Lys Thr Ser Thr Pro 
305 310 315 320 

Lys Lys Glu Gly Leu Ala Pro Pro Ser Pro Ser Leu Val Ser Asp Leu 
325 330 335 

Leu Ser Glu Leu Asn lie Ser Glu He Gin Lys Leu Lys Gin Gin Leu 
340 345 350 

Met Gin Met Glu Arg Glu Lys Ala Gly Leu Leu Ala Thr Leu Gin Asp 
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355 360 365 

Thr Gin Lys Gin Leu Glu His Thr Arg Gly Ser Leu Ser Glu Gin Gin 

370 375 380 

Glu Lys Val Thr Arg Leu Thr Glu Asn Leu Ser Ala Leu Arg Arg Leu 
385 390 " 395 400 

Gin Ala Ser Lys Glu Arg Gin Thr Ala Leu Asp Asn Glu Lys Asp Arg 
405 410 415 

Asp Ser His Glu Asp Gly Asp Tyr Tyr Glu Val Asp He Asn Gly Pro 
420 425 430 

Glu He Leu Ala Cys Lys Tyr His Val Ala Val Ala Glu Ala Gly Glu 
435 440 445 

Leu Arg Glu Gin Leu Lys Ala Leu Arg Ser Thr His Glu Ala Arg Glu 
450 455 460 

Ala Gin His Ala Glu Glu Lys Gly Arg Tyr Glu Ala Glu Gly Gin Ala 
465 470 475 480 

Leu Thr Glu Lys Val Ser Leu Leu Glu Lys Ala Ser Arg Gin Asp Arg 
485 490 495 

Glu Leu Leu Ala Arg Leu Glu Lys Glu Leu Lys Lys Val Ser Asp Val 
500 505 510 

Ala Gly Glu Thr Gin Gly Ser Leu Ser Val Ala Gin Asp Glu Leu Val 
515 520 525 

Thr Phe Ser Glu Glu Leu Ala Asn Leu Tyr His His Val Cys Met Cys 
530 535 540 

Asn Asn Glu Thr Pro Asn Arg Val Met Leu Asp Tyr Tyr Arg Glu Gly 
545 550 555 560 

Gin Gly Gly Ala Gly Arg Thr Ser Pro Gly Gly Arg Thr Ser Pro Glu 
565 570 575 

Ala Arg Gly Arg Arg Ser Pro He Leu Leu Pro Lys Gly Leu Leu Ala 
580 585 590 

Pro Glu Ala Gly Arg Ala Asp Gly Gly Thr Gly Asp Ser Ser Pro Ser 
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595 600 605 

Pro Gly Ser Ser Leu Pro Ser Pro Leu Ser Asp Pro Arg Arg GIu Pro 
610 615 620 



Met Asn lie Tyr Asp. Leu He Ala He He Arg Asp Gin lie Lys His 
625 630 635 640 

Leu Gin Ala Ala Val Asp Arg Thr Thr GIu Leu Ser Arg Gin Arg lie 
645 650 655 

Ala Ser Gin GIu Leu Gly Pro Ala Val Asp Lys Asp Lys GIu Ala Leu 
660 665 670 

Met GIu GIu He Leu Lys Leu Lys Ser Leu Leu Ser Thr Lvs Arg GIu 
675 680 685 

Gin He Thr Thr Leu Arg Thr Val Leu Lys Ala Asn Lys Gin Thr Ala 
690 695 700 

GIu Val Ala Leu Ala Asn Leu Lys Ser Lys Tyr GIu Asn GIu Lys Ala 
705 710 715 720 

Met Val Thr GIu Thr Met Met Lys Leu Arg Asn GIu Leu Lys Ala Leu 
725 730 735 

Lys GIu Asp Ala Ala Thr Phe Ser Ser Leu Arg Ala Met Phe Ala Thr 
740 745 750 

Arg Cys Asp GIu Tyr He Thr Gin Leu Asp GIu Met Gin Arg Gin Leu 
755 760 765 

Ala Ala Ala Glu Asp Glu Lys Lys Thr Leu Asn Ser Leu Leu Arg Met 
770 775 780 

Ala He Gin Gin Lys Leu Ala Leu Thr Gin Arg Leu Glu Leu Leu Glu 
785 790 795 800 

Leu Asp His Glu Gin Thr Arg Arg Gly Arg Ala Lys Ala Ala Pro Lys 
805 810 815 

Thr Lys Pro Ala Thr Pro Ser Leu 
820 
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<210>3 
<211>2940 
<212> DNA 
<213> Homo sapiens 

<400> 3 

atgtcggtgc tgggcgagta cgagcgacac tgcgattcca tcaactcgga ctttgggagc 60 
gagtccgggg gttgcgggga ctcgagtccg gggcctagcg ccagtcaggg gccgcgagcc 120 
ggcggcggcg cggcggagca ggaggaactg cactacatcc ccatccgcgt cctgggccgc 180 
ggcgccttcg gggaagccac gctgtaccgc cgcaccgagg atgactcact ggttgtgtgg 240 
aaggaagtcg atttgacccg gctgtctgag aaggaacgtc gtgatgcctt gaatgagatt 300 
gttattctgg cactgctgca gcatgacaac attattgcct actacaatca cttcatggac 360 
aataccacgc tgctgattga gctggaatat tgtaatggag ggaacctgta tgacaaaatc 420 
cttcgtcaga aggacaagtt gtttgaggaa gagatggtgg tgtggtacct atttcagatt 480 
gtttcagcag tgagctgcat ccataaagct ggaatccttc atagagatat aaagacatta 540 
aatatttttc tgaccaaggc aaacctgata aaacttggag attatggcct agcaaagaaa 600 
cttaattctg agtattccat ggctgagacg cttgtgggaa ccccatatta catgtctcca 660 
gagctctgtc aaggagtaaa gtacaatttc aagtctgata tctgggcagt tggctgcgtc 720 
atttttgaac tgcttacctt aaagaggacg tttgatgcta caaacccact taacctgtgt 780 
gtgaagatcg tgcaaggaat tcgggccatg gaagttgact ctagccagta ctctttggaa 840 
ttgatccaaa tggttcattc gtgccttgac caggatcctg agcagagacc tactgcagat 900 
gaacttctag atcgccctct tctcaggaaa cgcaggagag agatggagga aaaagtcact 960 
ctgcttaatg cacctacaaa gagaccaagg tcaagcactg tgactgaagc acccattgct 1020 
gtagtaacat cacgaaccag tgaagtctat gtttggggtg gtggaaaatc caccccccag 1080 
aaactggatg ttatcaagag tggctgtagt gcccggcagg tctgtgcagg gaatacccac 1 140 
tttgctgtgg tcacagtgga gaaggaactg tacacttggg tgaacatgca aggaggcact 1200 
aaactccatg gtcagctggg ccatggagac aaagcctcct atcgacagcc aaagcatgtg 1260 
gaaaagttgc aaggcaaagc tatccatcag gtgtcatgtg gtgatgattt cactgtctgt 1320 
gtgactgatg agggtcagct ctatgccttc ggatcagatt attatggctg catgggggtg 1380 
gacaaagttg ctggccctga agtgctagaa cccatgcagc tgaacttctt cctcagcaat 1440 
ccagtggagc aggtctcctg tggagataat catgtggtgg ttctgacacg aaacaaggaa 1500 
gtctattctt ggggctgtgg cgaatatgga cgactgggtt tggattcaga agaggattat 1560 
tatacaccac aaaaggtgga tgttcccaag gccttgatta ttgttgcagt tcaatgtggc 1620 
tgtgatggga catttctgtt gacccagtca ggcaaagtgc tggcctgtgg actcaatgaa 1680 
ttcaataagc tgggtctgaa tcagtgcatg tcgggaatta tcaaccatga agcataccat 1740 
gaagttccct acacaacgtc ctttaccttg gccaaacagt tgtcctttta taagatccgt 1 800 
accattgccc caggcaagac tcacacagct gctattgatg agcgaggccg gctgctgacc 1860 
tttggctgca acaagtgtgg gcagctgggc gttgggaact acaagaagcg tctgggaatc 1920 
aacctgttgg ggggacccct tggtgggaag caagtgatca gggtctcctg cggtgatgag 1980 
tttaccattg ctgccactga tgataatcac atttttgcct ggggcaatgg tggtaatggc 2040 
cgcctggcaa tgacccccac agagagacca catggctctg atatctgtac ctcatggcct 2100 
cggcctattt ttggatctct gcatcatgtc ccggacctgt cttgccgtgg atggcatacc 2160 
attctcatcg ttgagaaagt attgaattct aagaccatcc gttccaatag cagtggctta 2220 
tccattggaa ctgtgtttca gagctctagc ccgggaggag gcggcggggg cggcggtggt 2280 
gaagaagagg acagtcagca ggaatctgaa actcctgacc caagtggagg cttccgagga 2340 
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acaatggaag cagaccgagg aatggaaggt ttaatcagtc ccacagaggc catggggaac 2400 
agtaatgggg ccagcagctc ctgtcctggc tggcttcgaa aggagctgga aaatgcagaa 2460 
tttatcccca tgcctgacag cccatctcct ctcagtgcag cgttttcaga atctgagaaa 2520 
gataccctgc cctatgaaga gctgcaagga ctcaaagtgg cctctgaagc tcctttggaa 2580 
cacaaacccc aagtagaagc ctcgtcacct cggctgaatc ctgcagtaac ctgtgctaaa 2640 
aagggaacac cactgactcc tcctgcgtgt gcgtgcagct ctctgcaggt ggaggttgas 2700 
agattgcagg gtctggtgtt aaagtgtccg gctgaacaac agaagctaca gcaagaaaac 2760 
ctccagattt ttacccaact gcagaagttg aacaagaaat tagaaggagg gcagcaggtg 2820 
gggatgcatt ccaaaggaac tcagacagca aaggaagaga tggaaatgga tccaaagcct 2880 
gacttagatt cagattcctg gtgcctcctg ggaacagact cctgtagacc cagcctctag 2940 

<210>4 

<211>979 

<212>PRT 

<2 1 3> Homo sapiens 

<400> 4 

Met Ser Val Leu Gly Glu Tyr Glu Arg His Cys Asp Ser He Asn Ser 
15 10 15 

Asp Phe Gly Ser Glu Ser Gly Gly Cys Gly Asp Ser Ser Pro Gly Pro 
20 25 30 

Ser Ala Ser Gin Gly Pro Arg Ala Gly Gly Gly Ala Ala Glu Gin Glu 
35 40 45 

Glu Leu His Tyr He Pro lie Arg Val Leu Gly Arg Gly Ala Phe Glv 
50 55 60 

Glu Ala Thr Leu Tyr Arg Arg Thr Glu Asp Asp Ser Leu Val Val Tro 
65 70 75 80 

Lys Glu Val Asp Leu Thr Arg Leu Ser Glu Lys Glu Arg Arg Asp Ala 
85 90 95 

Leu Asn Glu He Val He Leu Ala Leu Leu Gin His Asp Asn He He 
100 105 110 

Ala Tyr Tyr Asn His Phe Met Asp Asn Thr Thr Leu Leu He Glu Leu 
115 120 125 

Glu Tyr Cys Asn Gly Gly Asn Leu Tyr Asp Lys He Leu Arg Gin Lys 
130 135 140 

Asp Lys Leu Phe Glu Glu Glu Met Val Val Trp Tyr Leu Phe Gin He 
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Val Ser Ala Val Ser Cys He His Lys Ala Gly He Leu His Arg Asp 
165 170 175 

He Lys Thr Leu Asn He Phe Leu Thr Lys Ala Asn Leu He Lys Leu 
ISO 185 190 

Gly Asp Tyr Gly Leu Ala Lys Lys Leu Asn Ser Glu Tyr Ser Met Ala 
195 200 205 

Glu Thr Leu Val Gly Thr Pro Tyr Tyr Met Ser Pro Glu Leu Cys Gin 
210 215 220 

Gly Val Lys Tyr Asn Phe Lys Ser Asp He Trp Ala Val Gly Cys Val 
225 " 230 235 240 

He Phe Glu Leu Leu Thr Leu Lys Arg Thr Phe Asp Ala Thr Asn Pro 
245 250 255 

Leu Asn Leu Cys Val Lys He Val Gin Gly He Arg Ala Met Glu Val 
260 265 270 

Asp Ser Ser Gin Tyr Ser Leu Glu Leu lie Gin Met Val His Ser Cys 
275 280 285 

Leu Asp Gin Asp Pro Glu Gin Arg Pro Thr Ala Asp Glu Leu Leu Asp 
290 295 300 

Arg Pro Leu Leu Arg Lys Arg Arg Arg Glu Met Glu Glu Lys Val Thr 
305 310 315 320 

Leu Leu Asn Ala Pro Thr Lys Arg Pro Arg Ser Ser Thr Val Thr Glu 
325 330 335 

Ala Pro He Ala Val Val Thr Ser Arg Thr Ser Glu Val Tyr Val Trp 
340 345 350 

Gly Gly Gly Lys Ser Thr Pro Gin Lys Leu Asp Val He Lys Ser Gly 
355 360 365 

Cys Ser Ala Arg Gin Val Cys Ala Gly Asn Thr His Phe Ala Val Val 
370 375 380 

Thr Val Glu Lys Glu Leu Tyr Thr Trp Val Asn Met Gin Gly Gly Thr 
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Lys Leu His Gly Gin Leu Gly His Gly Asp Lys Ala Ser Tyr Arg Gin 
405 410 415 

Pro Lys His Val Glu Lys Lou Gin Gly Lvs Ala lie His Gin Val Scr 
420 425 430 

Cys Gly Asp Asp Phe Thr Val Cys Val Thr Asp Glu Gly Gin Leu Tyr 
435 440 445 

Ala Phe Gly Ser Asp Tyr Tyr Gly Cys Met Gly Val Asp Lys Val Ala 
450 455 460 

Gly Pro Glu Val Leu Glu Pro Met Gin Leu Asn Phe Phe Leu Ser Asn 
465 470 475 480 

Pro Val Glu Gin Val Ser Cys Gly Asp Asn His Val Val Val Leu Thr 
485 490 495 

Arg Asn Lys Glu Val Tyr Ser Trp Gly Cys Gly Glu Tyr Gly Arg Leu 
500 505 510 

Gly Leu Asp Ser Glu Glu Asp Tyr Tyr Thr Pro Gin Lys Val Asp Val 
515 520 525 

Pro Lys Ala Leu lie He Val Ala Val Gin Cys Gly Cys Asp Gly Thr 
530 535 540 

Phe Leu Leu Thr Gin Ser Gly Lys Val Leu Ala Cys Gly Leu Asn Glu 
545 550 555 560 

Phe Asn Lys Leu Gly Leu Asn Gin Cys Met Ser Gly He He Asn His 
565 570 575 

Glu Ala Tyr His Glu Val Pro Tyr Thr Thr Ser Phe Thr Leu Ala Lys 
580 585 590 

Gin Leu Ser Phe Tyr Lys He Arg Thr He Ala Pro Gly Lys Thr His 
595 600 605 

Thr Ala Ala He Asp Glu Arg Gly Arg Leu Leu Thr Phe Gly Cys Asn 
610 615 620 

Lys Cys Gly Gin Leu Gly Val Gly Asn Tyr Lys Lys Arg Leu Gly He 
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625 630 635 640 

Asn Leu Leu Gly Gly Pro Leu Gly Gly Lys Gin Val He Arg Val Ser 

645 650 655 

Cys Gly Asp Glu Phe Thr lie Ala Ala Thr Asp Asp Asn His He Phe 
660 665 670 

Ala Trp Gly Asn Gly Gly Asn Gly Arg Leu Ala Met Thr Pro Thr Glu 
675 680 685 

Arg Pro His Gly Ser Asp He Cys Thr Ser Trp Pro Arg Pro He Phe 
690 695 700 

Gly Ser Leu His His Val Pro Asp Leu Ser Cys Arg Gly Trp His Thr 
705 710 715 720 

He Leu He Val Glu Lys Val Leu Asn Ser Lys Thr He Arg Ser Asn 
725 730 735 

Ser Ser Gly Leu Ser lie Gly Thr Val Phe Gin Ser Ser Ser Pro Gly 
740 745 750 

Gly Gly Gly Gly Gly Gly Gly Gly Glu Glu Glu Asp Ser Gin Gin Glu 
755 760 765 

Ser Glu Thr Pro Asp Pro Ser Gly Gly Phe Arg Gly Thr Met Glu Ala 
770 775 780 

Asp Arg Gly Met Glu Gly Leu He Ser Pro Thr Glu Ala Met Gly Asn 
785 790 795 800 

Ser Asn Gly Ala Ser Ser Ser Cys Pro Gly Trp Leu Arg Lys Glu Leu 
805 810 815 

Glu Asn Ala Glu Phe He Pro Met Pro Asp Ser Pro Ser Pro Leu Ser 
820 825 830 

Ala Ala Phe Ser Glu Ser Glu Lys Asp Thr Leu Pro Tyr Glu Glu Leu 
835 840 845 

Gin Gly Leu Lys Val Ala Ser Glu Ala Pro Leu Glu His Lys Pro Gin 
850 855 860 

Val Glu Ala Ser Ser Pro Arg Leu Asn Pro Ala Val Thr Cys Ala Gly 
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865 870 875 880 

Lys Gly Thr Pro Leu Thr Pro Pro Ala Cys Ala Cys Ser Ser Leu Gin 
885 890 895 

Val Glu Val Glu Arg Leu Gin Gly Leu Val Leu Lvs Cvs Leu Ala Glu 
900 905 910 

Gin Gin Lys Leu Gin Gin Glu Asn Leu Gin He Phe Thr Gin Leu Gin 
915 920 925 

Lys Leu Asn Lys Lys Leu Glu Gly Gly Gin Gin Val Gly Met His Ser 
930 935 940 

Lys Gly Thr Gin Thr Aia Lys Glu Glu Met Glu Met Asp Pro Lys Pro 
945 950 955 960 

Asp Leu Asp Ser Asp Ser Trp Cys Leu Leu Gly Thr Asp Ser Cys Arg 
965 970 975 

Pro Ser Leu 



<210>5 
<211>21 
<212>DNA 
<213>Mussp. 

<400> 5 

ccggtggatg tggaatgtgt g 2 1 

<210>6 
<211>22 
<212>DNA 
<213>Mus sp. 

<400> 6 

caaagccaag gttcattcgg tg 22 

<210>7 
<211>22 
<212>DNA 
<213>Mussp. 
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1 _> 

<'400> 7 

gccctgaatg aactgcagga eg 22 

<210> 8 
<21 1> 22 
<2 1 2> DNA 
<2 13> Mus sp. 

<400> 8 

caegggtage caaegctatg tc 22 

<210> 9 
<211>21 
<2 1 2> DNA 
<213> Mus sp. 

<400> 9 

cttccgcttc cacgacactc g 2 1 

<210> 10 
<211>21 
<212>DNA 
<213> Mus sp. 

<400> 10 

ctcaatggcc tcagacgcca g 2 1 

<210> 11 

<211>20 

<212>DNA 

<213> Homo sapiens 

<400> 1 1 

gectttggae aagcacacac 20 

<210> 12 

<211> 19 

<212>DNA 

<213> Homo sapiens 

<400> 12 

ctccttcagc tcctgggcc 19 

<210> 13 
<211>30 
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1 A- 

<212> DNA 
<213> Homo sapiens 

<400> 13 

cgagctgctc tatagactgc tgggtagtcc 30 

<210> 14 
<21 1> 30 
<212> DNA 
<213> Homo sapiens 

<400> 14 

taacagaggt ggcttatgag tatttcttcc 30 

<210>15 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 15 

aaaccacaag aaggtggctg 20 

<210> 16 

<211>22 

<212>DNA 

<2 1 3> Homo sapiens 

<400> 16 

aggtgaagcg gctgtcccac ga 22 

<210> 17 
<211>22 
<212>DNA 
<213> Homo sapiens 

<400> 17 

ctccttcagc tcctgggcca ca 22 

<210> 18 
<211> 821 
<212> PRT 
<213> Homo sapiens 

<400> 18 

Met Ala Ala Glu Glu Val Leu Gin Thr Val Asp His Tyr Lys Thr Glu 
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15 

20 ^ SG ,U Leu f hr Glu Thr Thr i^- 

50 *-A spGluLeuGiu AIa GiuT 

60 y SpSerLe "LysGJn 

Ar S Lys VaJ A/ a n, t 

G -Se, A , SerLy ^ 

l<b , ]0 yLySl,eLe "G]u M et 

Gin Asn GJu Leu Lys G ] n c A 

115 120 CrArgAia Valval Th a 

120 ]25 VaiThr A S nVaJGInAJ a 

G ^AsnGi uAr LeuThrA 

]4 /alG,nAsp LeuLysGiuAsnAsn 
f--e t Va J? LeuG]nArgi]e 

/U 175 y lhrG IuLeuGlu 

Gi "GluA S nJleThrLenr, r 

185 190 ihrLeuL ysGlnA Sn 

G^ValGluTyrGIun r 

195 20n yLeULysH ^GluIJei v . 



'A2 ( > 



WO 00/36097 

16 PCT/US99/29989 

245 250 255 



Ser LeuAsn Asp Asn His Ile Ser I^Ser Val Asp Gly Leu Lys Phe 

Ala GH, Asp G.y Ser G,« Pre ^ Asp Asp Lys Met ^ ^ ^ 
He Hjs Gly Pro Le, Val Lys G,y Asp Ty r Ar g Thr Pro Thr 

305 ^O 0 ' 11 Pr ° Val Ser Asp Leu Phe S er Glu 



320 



Leu Asn ,,e ■ Ser G .u I,e Gln L>, Leu L y s Gln Gln Lou Mc , Q|n 

*° 3j>0 335 

G.„ Arg Glu Lys A,a He Leu Leu Ala Asn Leu Gin G,u Ser G,„ Thr 
G,n Leu G,u His Tta L ys G, y AtaLeu Thr Glu Gln Hls G|u ^ v- 
HisArg Leu Thr Glu His Va, AsnAla Me, Arg Gly Leu Gi „ Ser Ser 
Lys G1 u Leu L y s Ala G,u Leu Asp Gly G, u Lys GIy Ar g Asp Ser G, y 



400 



G,u G,u Ala His Asp Tyr G1 u Val Asp He Asn G, y Leu Giu ,le Leu 

H}JD 410 415 

Glu Cys Lys Tyr Arg Va, Ala Va, Thr G,u Val He Asp Leu Lys Ala 
G,u „e Lys A! a LeuLys G,u Lys Tyr Asn Lys Ser Va, Glu Asn Tyr 
Thr Asp Glu Lys Ah, Lys Tyr Gh,Ser Lys He G,„ Me t Tyr Asp Glu 

Gln Va, Thr Ser Leu G,u Lys Thr Thr L y s Glu Ser Gly Glu L y s Met 

475 480 

Ala His Met Glu Lys Glu Leu Gln Lys Met Thr Ser Ile Ala Asn Glu 
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485 490 495 

Asn His Ser Thr Leu Asn Thr Ala Gin Asp Glu Leu Val Thr Phe Ser 
500 505 510 

Glu Glu Leu Ala Gin Leu Tyr His His Val Cys Leu Cys Asn Asn Glu 
515 520 525 

Thr Pro Asn Arg Val Met Leu Asp Tyr Tyr Arg Gin Ser Arg Val Thr 
530 535 540 

Arg Ser Gly Ser Leu Lys Gly Pro Asp Asp Pro Arg Gly Leu Leu Ser 
545 550 555 560 

Pro Arg Leu Ala Arg Arg Gly Val Ser Ser Pro Val Glu Thr Arg Thr 
565 570 575 

Ser Ser Glu Pro Val Ala Lys Glu Ser Thr Glu Pro Ser Lys Glu Pro 
580 585 590 

Ser Pro Thr Lys Thr Pro Thr He Ser Pro Val He Thr Ala Pro Pro 
595 600 605 

Ser Ser Pro Val Leu Asp Thr Ser Asp lie Arg Lys Glu Pro Met Asn 
610 615 620 

He Tyr Asn Leu Asn Ala He He Arg Asp Gin He Lys His Leu Gin 
625 630 635 640 

Lys Ala Val Asp Arg Ser Leu Gin Leu Ser Arg Gin Arg Ala Ala Ala 
645 650 655 

Arg Glu Leu Ala Pro Met He Asp Lys Asp Lys Glu Ala Leu Met Glu 
660 665 670 

Glu He Leu Lys Leu Lys Ser Leu Leu Ser Thr Lys Arg Glu Gin lie 
675 680 685 

Ala Thr Leu Arg Ala Val Leu Lys Ala Asn Lys Gin Thr Ala Glu Val 
690 695 700 

Ala Leu Ala Asn Leu Lys Asn Lys Tyr Glu Asn Glu Lys Ala Met Val 
705 710 715 720 

Thr Glu Thr Met Thr Lys Leu Arg Asn Glu Leu Lys Ala Leu Lys Glu 
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725 730 735 

Asp Ala Ala Thr Phe Ser Ser Leu Arg Thr Met Phe Ala Thr Arg Cys 
740 745 750 

Asp Glu Tyr V'al Thr Gin Leu Asp Glu Met Gin Arg Gin Leu Ala Ala 
755 760 765 

Ala Glu Asp Glu Lys Lys Thr Leu Asn Thr Leu Leu Arg Met Ala He 
770 775 780 

Gin Gin Lys Leu Ala Leu Thr Gin Arg Leu Glu Asp Leu Glu Phe Asp 
785 790 795 800 

His Glu Gin Ser Arg Arg Ser Lys Gly Lys Leu Gly Lys Ser Lys He 
805 810 815 

Gly Ser Pro Lys Vai 
820 



<210> 19 

<211>868 

<212> PRT 

<2 1 3> Homo sapiens 

<400> 19 

Asn Thr His Leu Glu Ala Gin Leu Gin Lys Ala Glu Glu Ala Gly Ala 
15 10 15 

Glu Leu Gin Ala Asp Leu Arg Asp He Gin Glu Glu Lys Glu Glu He 
20 25 30 

Gin Lys Lys Leu Ser Glu Ser Arg His Gin Gin Glu Ala Ala Thr Thr 
35 40 45 

Gin Leu Glu Gin Leu His Gin Glu Ala Lys Arg Gin Glu Glu Val Leu 
50 55 60 

Ala Arg Ala Val Gin Glu Lys Glu Ala Leu Val Arg Glu Lys Ala Ala 
65 70 75 80 

Leu Glu Val Arg Leu Gin Ala Val Glu Arg Asp Arg Gin Asp Leu Ala 
85 90 95 
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Glu Gin Leu Gin Gly Leu Ser Ser Ala Lys Glu Leu Leu Glu Ser Ser 
100 105 110 

Leu Phe Glu Ala Gin Gin Gin Asn Ser Val He Glu Val Thr Lys Gly 
115 120 125 

Gin Leu Glu Val Gin He Gin Thr Val Thr Gin Ala Lys Glu Val He 
130 135 140 

Gin Gly Glu Val Arg Cys Leu Lys Leu Glu Leu Asp Thr Glu Arg Ser 
145 150 155 160 

Gin Ala Glu Gin Glu Arg Asp Ala Ala Ala Arg Gin Leu Ala Gin Ala 
165 170 175 

Glu Gin Glu Gly Lys Thr Ala Leu Glu Gin Gin Lys Ala Ala His Glu 
180 185 190 

Lys Glu Val Asn Gin Leu Arg Glu Lys Trp Glu Lys Glu Arg Ser Tip 
195 200 205 

/ 

His Gin Gin Glu Leu Ala Lys Ala Leu Glu Ser Leu Glu Arg Glu Lys 
210 215 220 

Met Glu Leu Glu Met Arg Leu Lys Glu Gin Gin Thr Glu Met Glu Ala 
225 230 235 240 

He Gin Ala Gin Arg Glu Glu Glu Arg Thr Gin Ala Glu Ser Ala Leu 
245 " 250 255 

Cys Gin Met Gin Leu Glu Thr Glu Lys Glu Arg Val Ser Leu Leu Glu 
260 265 270 

Thr Leu Leu Gin Thr Gin Lys Glu Leu Ala Asp Ala Ser Gin Gin Leu 
275 280 285 

Glu Arg Leu Arg Gin Asp Met Lys Val Gin Lys Leu Lys Glu Gin Glu 
290 295 300 

Thr Thr Gly He Leu Gin Thr Gin Leu Gin Glu Ala Gin Arg Glu Leu 
305 310 315 320 

Lys Glu Ala Ala Arg Gin His Arg Asp Asp Leu Ala Ala Leu Gin Glu 
325 330 335 
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Glu Ser Ser Scr Leu Leu Gin Asp Lys Met Asp Leu Gin Lys Gin Val 
340 345 350 

Glu Asp Leu Lys Ser Gin Leu Val Ala Gin Asp Asp Ser Gin Arg Leu 

355 360 365 



Val Glu Gin Glu Val Gin Glu Lys Leu Arg Glu Thr Gin Glu Tyr Asn 
370 375 380 



Arg He Gin Lys Glu Leu Glu Arg Glu Lys Ala Ser Leu Thr Leu Ser 
385 390 395 400 

Leu Met Glu Lys Glu Gin Arg Leu Leu Val Leu Gin Glu Ala Asp Ser 
405 410 415 

He Arg Gin Gin Glu Leu Ser Ala Leu Arg Gin Asp Met Gin Glu Ala 
420 425 430 

Gin Gly Glu Gin Lys Glu Leu Ser Ala Gin Met Glu Leu Leu Arg Gin 
435 440 445 

Glu Val Lys Glu Lys Glu Ala Asp Phe Leu Ala Gin Glu Ala Gin Leu 
450 455 460 

Leu Glu Glu Leu Glu Ala Ser His He Thr Glu Gin Gin Leu Arg Ala 
465 470 475 480 

Ser Leu Trp Ala Gin Glu Ala Lys Ala Ala Gin Leu Gin Leu Arg Leu 
485 490 495 

Arg Ser Thr Glu Ser Gin Leu Glu Ala Leu Ala Ala Glu Gin Gin Pro 
500 505 510 

Gly Asn Gin Ala Gin Ala Gin Ala Gin Leu Ala Ser Leu Tyr Ser Ala 
515 520 525 

Leu Gin Gin Ala Leu Gly Ser Val Cys Glu Ser Arg Pro Glu Leu Ser 
530 535 540 

Gly Gly Gly Asp Ser Ala Pro Ser Val Trp Gly Leu Glu Pro Asp Gin 
545 550 555 560 

Asn Gly Ala Arg Ser Leu Phe Lys Arg Gly Pro Leu Leu Thr Ala Leu 
565 570 575 
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Ser Ala Glu Ala Val Ala Ser Ala Leu His Lys Leu His Gin Asp Leu 
580 585 590 

Trp Lys Thr Gin Gin Thr Arg Asp Val Leu Arg Asp Gin Val Gin Lys 
595 600 605 

Leu Glu Glu Arg Leu Thr Asp Thr Glu Ala Glu Lys Ser Gin Val His 
610 615 620 

Thr Glu Leu Gin Asp Leu Gin Arg Gin Leu Ser Gin Asn Gin Glu Glu 
625 630 635 640 

Lys Ser Lys Trp Glu Gly Lys Gin Asn Ser Leu Glu Ser Glu Leu Met 
645 650 655 

Glu Leu His Glu Thr Met Ala Ser Leu Gin Ser Arg Leu Arg Arg Ala 
660 665 670 

Glu Leu Gin Arg Met Glu Ala Gin Gly Glu Arg Glu Leu Leu Gin Ala 
675 680 685 

Ala Lys Glu Asn Leu Thr Ala Gin Val Glu His Leu Gin Ala Ala Val 
690 695 700 

Val Glu Ala Arg Ala Gin Ala Ser Ala Ala Gly He Leu Glu Glu Asp 
705 710 715 720 

Leu Arg Thr Ala Arg Ser Ala Leu Lys Leu Lys Asn Glu Glu Val Glu 
725 730 735 

Ser Glu Arg Glu Arg Ala Gin Ala Leu Gin Glu Gin Gly Glu Leu Lys 
740 745 750 

Val Ala Gin Gly Lys Ala Leu Gin Glu Asn Leu Ala Leu Leu Thr Gin 
755 760 765 

Thr Leu Ala Glu Arg Glu Glu Glu Val Glu Thr Leu Arg Gly Gin He 
770 775 780 

Gin Glu Leu Glu Lys Gin Arg Glu Met Gin Lys Ala Ala Leu Glu Leu 
785 790 795 800 

Leu Ser Leu Asp Leu Lys Lys Arg Asn Gin Glu Val Asp Leu Gin Gin 
805 810 815 
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Glu Gin He Gin Glu Leu Glu Lys Cys Arg Ser Val Leu Glu His Leu 
820 825 830 

Pro Met Ala Val Gin Glu Arg Glu Gin Lys Leu Thr Val Gin Arg Glu 
835 840 845 

Gin He Arg Glu Leu Glu Lys Asp Arg Glu Thr Gin Arg Asn Val Leu 
850 855 860 

Glu His Gin Leu 
865 
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